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DNA REPLICATION PROTEINS OF GRAM POSITIVE BACTERIA AND 
THEIR USE TO SCREEN FOR CHEMICAL INHIBITORS 

The present application is a continuation-in-part of U.S. Patent 
Application Serial No. 09/235,245 filed January 22, 1999, which claims benefit of 
U.S. Provisional Patent Application Serial No. 60/093,727 filed July 22, 1998, and 
U.S. Provisional Patent Application Serial No. 60/074,522 filed January 22, 1998, all 
of which are hereby incorporated by reference. The present application also claims 
benefit of U.S. Provisional Patent Application Serial No. 60/146,1 78 filed July 29, 
1 999, which is hereby inc9rporated by reference. 

The present invention was made with fiinding fi-om National Institutes 
of Health Grant No. GM38839. The United States Government may have certain 
rights in this invenrion. 

FIELD OF THE INVENTION 

This invention relates to genes and proteins that replicate the 
chromosome of Gram positive bacteria. These proteins can be used in sequencing, 
amplification of DNA, and in drug discovery to screen large libraries of chemicals for 
identification of compounds with antibiodc acfivity. 

BACKGROUND OF THE INVENTION 

All forms of life must duplicate the genetic material to propagate the 
species. The process by which the DNA in a chromosome is duplicated is called 
replication. The replication process is performed by numerous proteins that 
coordinate their actions to duplicate the DNA smoothly. The main protein actors are 
as follows (reviewed in Romberg et al., DNA Replication. Second Edition, New 
York: W,H. Freeman and Company, pp. 165-194 (1992)). A helicase uses the energy 
of ATP hydrolysis to unwind the two DNA strands of the double helix. Two copies of 
the DNA polymerase use each "daughter" strand as a template to convert them into 
two new duplexes. The DNA polymerase acts by polymerizing the four monomer unit 
building blocks of DNA (the 4 dNTPs, or deoxynucleoside triphosphates are: dATP, 
dCTP, dGTP, dTTP). The polymerase rides along one strand of DNA using it as a 
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template that dictates the sequence in which the monomer blocks are to be 
polymerized. Sometimes the DNA polymerase makes a mistake and includes an 
incorrect nucleotide (e.g., A instead of G). A proofreading exonuclease examines the 
polymer as it is made and excises building blocks that have been improperly inserted 
in the polymer. 

Duplex DNA is composed of two strands that are oriented antiparallel 
to one another, one being oriented 3'-5' and the other 5' to 3'. As the helicase 
unwinds the duplex, the DNA polymerase moves continuously forward with the 
helicase on one strand (called the leading strand). However, due to the fact that DNA 
polymerases can only extend the DNA forward from a 3' terminus, the polymerase on 
the other strand extends DNA in the opposite direction of DNA unwinding (called the 
lagging strand). This necessitates a discontinuous ratcheting motion on the lagging 
strand in which the DNA is made as a series of Okazaki fragments. DNA 
polymerases cannot initiate DNA synthesis de /iovo„but require a primed site (i.e., a 
short duplex region). This job is fulfilled by primase, a specialized RNA polymerase, 
that synthesizes short RNA primers on the lagging strand. The primed sites are 
extended by DNA polymerase. A single-stranded DNA binding protein ("SSB") is 
also needed; it operates on the lagging strand. The function of SSB is to coat single 
stranded DNA ("ssDNA"), thereby melting short hairpin duplexes that would 
otherwise impede DNA synthesis by DNA polymerase. 

The replication process is best understood for the Gram negative 
bacterium Escherichia coli and its bacteriophages T4 and T7 (reviewed in Kelman et 
al., "DNA Polymerase IE Holoenzyme: Structure and Function of Chromosomal 
Replicating Machine," Annu. Rev. Biochem. . 64:171-200 (1995); Marians, K.J., 
"Prokaryotic DNA Replication." Annu. Rev. Biochem 61:673-719 (1992); McHenry, 
C.S., "DNA Polymerase HI Holoenzyme: Components, Structure, and Mechanism of 
a True Replicative Complex," J. Bio. Chem.. 266:19127-19130 (1991); Young et al., 
"Structure and Function of the Bacteriophage T4 DNA Polymerase Holoenzyme." 
Am. Chem. Soc., 31:8675-8690 (1992)). The eukaryotic systems of yeast 
iSaccharomyces cerevisae) (Morrison et al., "A Third Essential DNA Polymerase in 
S. cerevisiaerCd^, 62:1 143-51 (1990) and humans (Bambara et al., "Reconstitution 
of Mammalian DNA Replication." Prog. Nuc. Acid Res. " 51:93-123 (1995)) have 
also been characterized in some detail as has herpes virus (Boehmer et al., "Herpes 
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Simplex Virus DNA Replication " Annu. Rev. Biochem. , 66:347-384 (1997)) and 
vaccinia virus (McDonald et al., "Characterization of a Processive Form of the 
Vaccinia Virus DNA Polymerase " Virology , 234:168-175 (1997)). The helicase ofE. 
coll is encoded by the dnaB gene and is called the DnaB-helicase. In phage T4, the 
5 helicase is the product of the gene 41 , and, in T7, it is the product of gene 4. 

Generally, the helicase contacts the DNA polymerase in E, colt. This contact is 
necessary for the helicase to achieve the catalytic efficiency needed to replicate a 
chromosome (Kim et al., "Coupling of a Replicative Polymerase and Helicase: A tau- 
DnaB Interaction Mediates Rapid Replication Fork Movement," Cell , 84:643-650 

10 (1 996)). The identity of the helicase that acts at the replication fork in a eukaryotic . 

cellular system is still not firm. 

The primase of £. coli (product of the dnaG gene), phage T4 (product 
of gene 61), and T7 (gene 4) require the presence of their cognate hehcase for activity. 
The primase of eukaryotes, called DNA polymerase alpha, looks and behaves 

15 differently. DNA polymerase alpha is composed of 4 subunits. The primase activity 
is associated with the two smaller subunits, and the largest subunit is the DNA 
polymerase which extends the product of the priming subunits. DNA polymerase 
alpha does not need a helicase for priming activity on single strand DNA that is not 
coated with binding protein. 

20 The chromosomal replicating DNA polymerase of all these systems, 

prokaryotic and eukaryotic, share the feature that they are processive, meaning they 
remain continuously associated with the DNA template as they link monomer units 
(dNTPs) together. This catalytic efficiency can be manifest in vitro by their ability to 
extend a single primer around a circular ssDNA of over 5,000 nucleotide units in 

25 length. Chromosomal DNA polymerases will be referred to here as replicases to 
distinguish them from DNA polymerases that function in other DNA metabolic 
processes and are far less processive. 

There are three types of replicases known thus far that differ in how 
they achieve processivity and how their subunits are organized. These will be referred 

30 to here as Types I-DI. The Type I is exemplified by the phage T5 replicase, which is 
composed of only one subunit yet is highly processive (Das et ah, "Mechanism of 
Primer-template Dependent Conversion of dNTP-dNMP by T7 DNA Polymerase," J. 
Biol. Chem. . 255:7149-7154 (1980)). It is possible that the T5 enzyme achieves 
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processivity by having a cavity within it for binding DNA, with a domain of the 
protein acting as a lid that opens to accept the DNA and closes to trap the DNA inside, 
thereby keeping the polymerase on DNA during polymerization of dNTPs. Type n is 
exemplified by the replicases of phage T7, herpes simplex virus, and vaccinia virus. 
In these systems, the replicase is composed of two subunits, the DNA polymerase and 
an "accessory protein" which is needed for the polymerase to become highly efficient. 
It is presumed that the DNA polymerase binds the DNA in a groove and that the 
accessory protein forms a cap over the groove, trapping the DNA inside for processive 
action. Type ID is exemplified by the replicases ofE. coli, phage T4, yeast, and 
humans in which there are three separate components, a sliding clamp protein, a 
clamp loader protein complex, and the DNA polymerase. In these systems, the sliding 
clamp protein is an oligomer in the shape of a ring. The clamp loader is a 
multiprotein complex which uses ATP to assemble the clamp around DNA. The 
DNA polymerase then binds the clamp which tethers the polymerase to DNA for high 
processivity. The replicase of the E, coli system contains a fourth component called 
tau that acts as a glue to hold two polymerases and one clamp loader together into one 
structure called Pol m*. In this application, any replicase that uses a minimum of 
three components (i.e., clamp, clamp loader, and DNA polymerase) will be referred to 
as either a three component polymerase, a type m enzyme, or a DNA polymerase lU- 
type replicase. 

The E. coli replicase is also called DNA polymerase III holoenzyme. 
The holoenzyme is a single multiprotein particle that contains all the components; it is 
comprised often different proteins. This holoenzyme is suborganized into four 
fiinctional components called: 1) Pol EI core (DNA polymerase); 2) gamma complex 
or tau/gamma complex (clamp loader); 3) beta subunit (sliding clamp); and 4) tau 
(glue protein). The DNA polymerase m "core" is a tightly associated complex 
containing one each of the following three subunits: 1) the alpha subunit is the actual 
DNA polymerase (129 kDa); 2) the epsilon subunit (28 kDa) contains the 
proofreading 3*-5' exonuclease activitjr, and 3) the theta subunit has an unknown 
function. The gamma complex is the clamp loader and contains the following 
subunits: gamma, delta, delta prime, chi and psi (U.S. Patent No. 5,583,026 to 
O'Donnell). Tau can substitute for gamma, as can a tau/gamma heterooligomer. The 
beta subunit is a homodimer and forms the ring shaped sliding clamp. These 
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components associate to form the holoeiizyme and the entire holoenzyme can be 
assembled in vitro from 10 isolated pure subunits (U.S. Patent No. 5,583,026 to 
ODonnell; U.S. Patent No. 5,668,004 to O'Donnell). The E, coli dnaX gene encodes 
both tau and gamma. Tau is the product of the fiill gene. Gamma is the product of the 
5 first 2/3 of the gene; it is truncated by an efficient translational frameshift that resuhs 

in incorporation of one unique residue followed by a stop codon. 

The tau subunit, encoded by the same gene that encodes gamma 
(dnaX), also acts as a glue to hold two cores together with one gamma complex. This 
subassembly is called DNA polymerase HI star (Pol HI*). One beta ring interacts 

10 with each core in Pol HI* to form DNA polymerase III holoenzyme. 

During replication, the two cores in the holoenzyme act coordinately to 
synthesize both strands of DNA in a duplex chromosome. At the replication fork, 
DNA polymerase HI holoenzyme physically interacts with the DnaB helicase through 
the tau subunit to form a yet larger protein complex termed the **replisome" (Kim et 

15 al., "Coupling of a Replicative Polymerase and Helicase: A tau-DnaB Interaction 

Mediates Rapid Replication Fork Movement," CeU, 84:643-650 (1996); Yuzhakov et 
al., "Replisome Assembly Reveals the Basis for Asymmetric Function in Leading and 
Lagging Strand Replication," CeH, 86:877-886 (1996)). The primase repeatedly 
contacts the helicase during replication fork movement to synthesize RNA primers on 

20 the lagging strand (Marians, K. L, "Prokaryotic DNA Replication," Annu. Rev. 

Biochem., 61:673-719 (1992)). 

Intensive subtyping of prokaryotic cells has now lead to a taxonomic 
classification of prokaryotic cells as eubacteria (true bacteria) to distinguish them 
from archaebacteria. Within eubacteria are many different subcategories of cells, 

25 although they can broadly be subdivided into Gram positive - and Gram negative-like 
cells. Numerous complete and partial genome sequences of prokaryotes have 
appeared in the public databases. 

In the present invention, new genes from the Gram positive bacteria. 
Streptococcus pyogenes (e.g., S. pyogenes) and Staphylococcus aureus (e.g., 5. 

30 aureus) are identified. They are assigned names based on their nearest homology to 
subunits in the E, coli system. The genes encoding E. coli replication proteins are as 
follows: alpha {dnaE); epsilon {dnaQ)\ theta {holE)\ tau (fiill length dnaX)\ gamma 
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(frameshift product of dnaX); delta (holA)\ delta prime (holB); chi (hoIQ; psi (holD); 
beta {dnaN)\ DnaB helicase (dnaB); and primase (dnaG). 

Study of the organisms for which a complete genome sequence is 
available reveals that no organism has identifiable homologues to all the subunits of 
the E. coli three component polymerase, Pol m holoenzyme (see Table 1 below). All 
other organisms lack the 0 subunit {holE), and all except one lack genes encoding 
the X and V/ subunits [holC and holD, respectively) as judged by sequence comparison 
searches. Further, the a and e subunits are fijsed into one large a subunit in some 
organisms (e.g., Gram positive cells) as detailed in (Sanjanwala et al, "DNA 
Polymerase ffl Gene oi Bacillus subtilis;' Proc. Natl. Acad. Sci.. USA . 86:4421-4424 
(1989)). Although all organisms have homologues to t, 5' and SSB, the 5 subunit has 
diverged significantly (either not recognized or nearly not recognized by gene 
searching programs), perhaps even to the point where it is no longer involved in DNA 
replication. The DnaX product also would appear to lack frameshift signals in most 
organisms. This predicts only one protein (tau) will be produced from this gene, 
instead of two as in E. coli. Indeed, this has been shown to be true for the 
Staphylococcus aureus DnaX (U.S. Patent Application Serial No. 09/235,245, which is 
hereby incorporated by reference). Finally, genetic study oi Bacillus subtilis identified 
two genes that do not have counterparts in E. coli {dnaB, not the helicase, and dnaH) as 
well as one other gene, dnal, that is only very distantly related to E. coli dnaC 
(Karamata et al., "Isolation and Genetic Analysis of Temperature-Sensitive Mutants of 
B. subtilis Defense in DNA Synthesis," Molec. Gen. Genet. . 108:277-287 (1970); 
Braund et al., "Nucleotide Sequence of the Bacillus subtilis dnoD Gene," Microb., 
141:321-322 (1995); Hoshino et al., ^'Nucleotide Sequence o{ Bacillus subtilis dnaB: A 
Gene Essential for DNA Replication Initiation and Membrance Attachment," Proc. 
Natl. Acad. Sci. USA," 84:653-657 (1987)). Keeping in mind the apparently random, 
or at least unpredictable process of evolution, it is possible that these apparently new 
genes perform novel functions that may result in a new type of polymerase for 
chromosomal replication. Thus, it seems possible that new proteins may have evolved 
to take the place of x, e, the frameshift product of DnaX, and possibly 6 in other 
eubacteria. These considerations indicate that the three component polymerase of 
different eubacteria may have different structures. That this may be so would not be 
surprising as different bacteria are often less related evolutionarily than plants are to 
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humans. For example, the split between Gram positive and Gram negative bacteria 
occurred about 1 .2 billion years ago. This distant split makes Gram positive cells an 
attractive source to examine how different other eubacterial three component 
polymerases are from the E, coli Pol IH holoenzyme. 

Table 1 



Organism (Order) 


X 




e 


e 


a 




dnaX 




6 




Escherichia coli 
Proteobacteria 


+ 


+ 


+ 


+ 


+ 


+ 


+ 




+ 




Haemophilus influenzae 
Proteobacteria 


+ 




- 


+ 


+ 




+ 


+ 


+ 




Mycoplasma genitalium 
Firmicutes 


— 


— 


- 


- 


+ 


+ 


+ 


+ 


+ 


(weak) 


Synichisystis sp. 
Cyanobacteria 


- 


- 


- 


- 


+ 


+ 


+ 




+ 


(weak) 


Bacillus subtil is 
Firmicutes 


— 


- 


- 


- 




+ 


+ 




+ 


(weak) 


Spirochaetales 










+ 










(weak) 


Aquifex aeolicus 
Aquificales 
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+ 


+ 




+ 


+ 


(weak) 


Mycobacterium tuberculosis 
Firmicutes & Actinobacteria 










+ 


+ 


+ 


+ 




(weak) 


Treponema pallidum 
Spirochaetales 








+ 


+ 


+ 




+ 


+ 


(weak) 


Chlamydia trachomatis 
Chlamydiales 








+ 


+ 


+ 




+ 


+ 


(weak) 


Rickettsia prowazekii 
Proteobacteria 








+ 


+ 


+ 




+ 


+ 


(weak) 


Helicobacter pylori 
Proteobacteria 








+ 


+ 


+ 






+ 


(weak) 


Thermatoga maritima 
Thermotogales 










+ 




+ 


+ 


+ 


(weak) 



The goal of this invention is to learn how to form a functional three 
component polymerase from an organism that is highly divergent from £. coli and 
whether it is as rapid and processive as the £. coli Pol III holoenzyme. Namely, from 
bacteria lacking x. m;, or e, or having a widely divergent 6 subunit, or having only one 
DnaX product, or an a subunit that encompasses both a and e activities. All 
eubacteria for which the entire genome has been sequenced have at least one of these 
differences from E, coli. Many Gram negative bacteria have one or more of these 
differences (e.g., Haemophilus influenzae and Aquifex aeolicus ). Bacteria of the 
Gram positive class have all of these different features. Because of the distant 



wo 01/09164 



-8- 



PCT/USOO/20666 



evolutionaiy split between Gram positive and Gram negative bacteria, their 
mechanisms of replication may have diverged significantly as well. Indeed, 
purification of the replication polymerase from B. subtilis, a Gram positive cell, gives 
only a single subunit polymerase (Barnes et al.. "Purification of DNA Polymerase m 
of Gram-Positive Bacteria," MethodsEnzy, 262:35-42 (1995); Barnes et al., 
"Antibody to B. subtilis DNA Polymerase ffl: Use in Enzyme Purification and 
Examination of Homology Among Replication-specific DNA Polymerases," Nucl. 
A^'<^sRes. , 6:1203-209 (1979); Barnes et al., "DNA Polymerase m of Mycoplasma 
pulmonis: Isolation and Characterization of the Enzyme and its Structural Gene, 
polC," Mol. Microb., 13:843-854, (1994); Low et al., "Purification and 
Characterization of DNA Polymerase m from Bacillus subtilis," J. Biol. Chem. . 
251:131 1-1325 (1976)) instead of a 10 subunit assembly containing the three 
components of a rapidly processive machine as discussed above for Pol III 
holoenzyme fi-om E. coli. This finding suggests a different structural organization of 
the replicase and possibly different functional characteristics as well. 

Although there are many studies of replication mechanisms in 
eukaryotes and, specifically, the Gram negative bacterium E. coli and its 
bacteriophages, there is very little mformation about how Gram positive organisms 
replicate. The Gram positive class of bacteria includes some of the worst human 
pathogens such as Staphylococcus aureus. Streptococcus pneumoniae. Streptococcus 
pyogenes, Enterococcus faecalis, and Mycobacterium tuberculosis (Youmans et al., 
The Biological and Clinical Basis of Infectious Disease (1985)). Until this invention, 
the best characterized Gram positive organism for chromosomal DNA synthesis was 
Bacillus subtilis. Fractionation of B. subtitis has identified three DNA polymerases. 
(Gass et al., "Further Genetic and Enzymological Characterization of the Three 
Bacillus subtilis Deoxyribonucleic Acid Polymerases." J. Biol. Chem. . 248:7688-7700 
(1973); Ganesan et al., "DNA Replication in a Polymerase I Deficient Mutant and the 
Identification of DNA Polymerases H and m in Bacillus subtilis," Biochem. Biop hys. 
Res.Commun. . 50:155-163 (1973)). These polymerases are thought to be analogous 
to the three DNA polymerases of E. coli (DNA polymerases I, H, and HI). Studies in 
B. subtilis have identified a polymerase that appears to be involved in chromosome 
replication and is termed Pol m (Ott et al., "Cloning and Characterization of the po/C 
Region of Bacillus subtilis," J. Bacteriol , 165:951-957 (1986); Barnes et al.. 
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"Localization of the Exonuclease and Polymerase Domains of Bacillus subtilis DNA 
Polymerase III/' Gene, 1 1 1 :43-49 (1992); Barnes et al., "The 3'-5' Exonuclease Site 
of DNA Polymerase m From Gram-positive Bacteria: Definition of a Novel Motif 
Structure," Gene " 165:45-50 (1995) or Barnes et al., "Purification of DNA 
Polymerase m of Gram-positive Bacteria," Methods in Enzv. , 262:35-42 (1995)). The 
B, subtilis Pol m (encoded hy polQ is larger (about 165 kDa) than the E. call alpha 
subunit (about 129 kDa) and exhibits 3'-5' exonuclease activity. The polC gene 
encoding this Pol EI shows weak homology to the genes encoding E. coli alpha and 
the E. coli epsilon subunit. Hence, this long form of the B. subtilis Pol EQ (herein 
referred to as a -large or Pol IH-L) essentially comprises both the alpha and epsilon 
subunits of the E. coli core polymerase. The S. aureus a -large has also been 
sequenced, expressed in E, coli, and purified; it contains DNA polymerase and 3*-5' 
exonuclease activity (Pacitti et al., "Characterization and Overexpression of the Gene 
Encoding Staphylococcus aureus DNA Polymerase ID " Gene , 165:51-56 (1995)). 
Although a -large is essential to cell growth (Clements et al, "Inhibition of Bacillus 
subtilis Deoxyribonucleic Acid Polymerase m by Phenylhydrazinopyrimidines: 
Demonstration of a Drug-induced Deoxyribonucleic Acid-Enzyme Complex," J. Biol. 
Chem., 250:522-526 (1975); Cozzarelli et al., "Mutational Alteraction of Bacillus 
subtilis DNA Polymerase III to Hydroxyphenylazopyrimidine Resistance: Polymerase 
m is Necessary for DNA Replication," Biochem. And Biophv. Res. Commun. . 
51:151-157 (1973); Low et al., "Mechanism of Inhibition of Bacillus subtilis DNA 
Polymerase m by the Arylhydrazinopyrimidine Antimicrobial Agents," Proc. Natl. 
Acad. Sci. USA, 71 :2973-2977 (1974)), there could still be another DNA 
polymerase(s) that is essential to the cell, such as occurs in yeast (Morrison et al., "A 
Third Essential DNA Polymerase in S. cerevisiae'' Cell 62:1 143-1 151 (1990)). 

Purification of a -large from B. subtilis results in only this single 
protein without associated proteins (Barnes et al, "Localization of the Exonuclease 
and Polymerase Domains of Bacillus subtilis DNA Polymerase HI," Gene . 1 1 1 :43-49 
(1992); Barnes et al, "The 3'-5* Exonuclease Site of DNA Polymerase EI From 
Gram-positive Bacteria: Definition of a Novel Motif Structure," Gene " 165:45-50 
(1995) or Barnes et al, 'Turification of DNA Polymerase m of Gram-positive 
Bacteria," Methods in Enzvmol. 262:35-42 (1995)). Hence, it is possible that a -large 
is a member of the Type I repHcase (like T5) in which it is processive on its own 
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without accessory proteins. B, subtilis and S. aureus also have a gene encoding a 
protein that has approximately 30% homology to the beta subunit o(E. coli; however, 
the protein product has not been purified or characterized (Alonso et al., "Nucleotide 
Sequence of the recF Gene Cluster From Staphylococcus aureus and 
Complementation Analysis in Bacillus subtilis recF Mutants," Mol. Gen. Genet. . 
246:680-686 (1995); Alonso et al., 'TsJucleotide Sequence of the recF Gene Cluster 
From Staphylococcus aureus and Complementation Analysis in Bacillus subtilis recF 
Mutants," Mol. Gen. Genet.. 248:635-636 (1995)). Whether this beta subunit has a 
function in replication, a ring shape, or functions as a sliding clamp was not known 
until recently. It was also not known whether it is functional with a -large. Recently, 
It was shown that S, aureus p is functional as a ring, and that it also functions with a - 
large (U.S. Patent Application Serial No. 09/235,245, which is hereby incorporated by 
reference). Further, a fourth DNA polymerase was identified with greater homology 
to E, coli a than a -large. This polymerase, called herein a -small, is shorter than a - 
large and lacks the domain homologous to epsilon. This polymerase also functions 
with the p ring, indicating that it may participate in chromosome replication. Indeed, a 
recent report indicates that a -small is essential for replication in Streptomyces 
coelicolor A3(2) (Flett et al., "A Gram-negative type* DNA Polymerase EI is Essential 
for Replication of the Linear Chromosome of Streptomyces Coelicolor A3(2)," Mol. 
Micro. , 31 :949-958, (1999)). 

As described earlier, purification of the replicase fi-om the Gram 
positive B. subtilis gives only a single subunit Pol m, instead of a multicomponent 
complex. Also, 5. aureus dnaXYi?iS been shown to encode only one subunit (U.S. 
Patent Application Serial No. 09/235,245, which is hereby incorporated by reference). 
Moreover, 5. aureus and B, subtilis lack homologues to x. v, e, and the 6 subunit is 
only weakly homologous to 6 of E, coli (only 28%). Further, they lack a homologue to 
dnaQ encoding e. Instead, they contain this activity (3*-5' exonuclease) in the/?o/C 
gene product which provides the a -large form of a. The e subunit is needed for high 
speed and processivity of the E, coli Pol IH holoenzyme; the a subunit alone is much 
less rapid and processive with the p ring compared to the presence of both 
a and e (Studwell et al., "Processive Replication is Contingent on the Exonuclease 
Subunit of DNA Polymerase HI Holoenzyme," J. Biol Chem . 265: 1 171-1 178 (1990)). 
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Studies using the E, coli p ring (and y complex) show they confer onto 
S. aureus a quite efficient synthesis (U.S. Patent AppHcation Serial No. 09/235,245, 
which is hereby incorporated by reference), but the efficiency is not equal to that of £. 
coli cLt with p (and y complex). This may be due to use of the heterologous 
5 combination of an a subunit from one organism {S. aureus) with the p clamp from 
another (E. coli,). However, it is also possible that S. aureus a simply does not 
fiinction with a p clamp to produce speed and processivity comparable to the E, coli 
polymerase. Also, as described earlier, the a -large subunit of 5. subtilis pxirifies as a 
single subunit, rather than associated with accessory subunits assembled into the three 

10 components of a rapid, processive machine (i.e., like E. coli Pol III holoenzyme). The 
lack of two DnaX products, lack of a multicomponent structure, and lack of gene 
homologues encoding several subunits of the three component, Pol ID, of E. coli brings 
into question whether other types of bacteria, such as Gram positive cells, even have an 
enzyme with similar structure or comparable speed and processivity to that found in 

15 the Gram negative E, coli. 

The lack of gene homologues encoding several subunits of the E. coli 
three component polymerase creates uncertainties with respect to reconstructing a 
rapid and processive polymerase from a Gram positive cell that has characteristics like 
the Pol in system of £*. coli. 

20 The y and 5' proteins are homologous to one another, encoding C-shape 

proteins (Dong et al., "DNA Polymerase III Accessory Proteins," J. Biol. Chem . 
268:1 1758-1 1765, (1993); Guenther et al., "Crystal Structure of the 5' Subunit of the 
Clamp-loader Complex of £. coli DNA Polymerase ED," Cell, 91 :335-345 (1997)). 
The clamp loaders of yeast and humans are composed of five proteins, all of which are 

25 homologous to one another and to y and 6* (Cullman et al., "Characterization of the 
Five Replication Factor C Genes of Saccharomyces Cerevisiae," MoL Cell. Biol. , 
15:4661-4671 (1995)). This provides evidence that a clamp loader can be composed 
entirely of C-shape proteins. Perhaps the Gram positive DnaX-protein (hereafter 
referred to as t) and 5' are sufficient to provide fiinction as a clamp loader. Indeed, the 

30 clamp loader of T4 phage is composed of only two different proteins, gp44/62 

complex (Young et al., "Structure and Function of the Bacteriophage T4 DNA 
Polymerase Holoenzyme," Biochem.. 31:8675-8690 (1992)). This idea is also 
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supported by the presence of only two RFC genes in archaebacteria, suggesting that 
they may utilize two C-shaped proteins for clamp loading, in contrast to yeast and 
humans that use five. With this consideration in mind, genes were identified and 
isolated and the t protein (encoded by dnaX) and 5* (encoded by holB) of another 
Gram positive organism. Streptococcus pyogenes, were expressed and purified. As 
was observed in S. aureus, S. pyogenes dnaX producQS only a single polypeptide. The 
p, encoded by dnaN of S. pyogenes, was also identified, expressed, and purified, as 
were the a -large subunit encoded by polC and the SSB encoded by the ssb gene. 
These proteins were studied for interactions and characterized for their effect on a- 
large. However, the hypothesis was incorrect as t and 5' did not form a t5' complex, 
nor did they assemble p onto DNA or provide stimulation of a when using p on 
primed and SSB coated M13mpl 8 ssDNA. 

In light of the inability of 5. pyogenes t protein and 6' to function as a 
clamp loader, it seemed reasonable to expect that one or more other proteins are 
needed. The fact that E. coli has some replicase subunits that other bacteria do not, 
suggests that other bacteria may have some replicase subunits that coli does not. 
Indeed, genetic studies of Bacillus subtilis demonstrates that it has three genes needed 
for replication that E, coli does not have. Two of these novel genes, called dnaB (not 
the same as E. coli dnaB encoding the helicase) and dnaH, have no significant 
homology to genes in the E, coli genome database (Bruand et al., "Nucleotide 
Sequence of the Bacillus subtilis dnaP gene/' Microbiol .. 141:321-322 (1995); 
Hoshino et al., "Nucleotide Sequence of Bacillus subtilis dnaB: A gene Essential for 
DNA replication Initiation and Membrane Attachment," Proc. Natl. Acad. Sci. USA. 
84:653-657 (1987)). Further, dnal of B, subtilis is important for replication and has, 
at best, a very limited homology to E. coli dnaC (Karamata et al., "Isolation and 
Genetic Analysis of Temperature-Sensitive Mutants of 5. subtilis Defective in DNA 
synthesis," Molec. Gen. Genetics . 108:277-287 (1970)). Perhaps one or more of these 
genes encode the proteins(s) necessary to provide clamp loading activity when 
combined with t and 6*, or to couple with a to provide it with speed and/or 
processivity as the E. coli epsilon does. The S, pyogenes homologues of 5. subtilis 
dnal, dnaH, and dnaB were identified, cloned, and the encoded proteins were 
expressed and purified. However, these proteins failed to provide activity alone or in 
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combinations with S. pyogenes t and 6' in loading S. pyogenes p onto DNA, or in 
stimulating S. pyogenes a -large in combination with p, t, and 6' on SSB coated 
primed M13mpl8 ssDNA. 

Weak homology exists for the holA gene among prokaryotes. This 
weak homologue of holA was identified in S. pyogenes and, then, it was cloned, 
expressed, and the putative 5 was purified. The putative 6 formed an isolatable 
complex with t and 5'. In fact, the t55' complex loaded S, pyogenes p onto DNA, and 
it stimulated S. pyogenes a -large in a p dependent reaction on primed SSB coated 
Ml 3mpl 8 ssDNA. Hence, this protein was the only missing component necessary to 
provide clamp loading activity. Further, a mixture of a with t56*, followed by ion 
exchange chromatography on MonoQ, indicated formation of an axfiS' complex. 
Consistent with this, t appeared to bind a in gel filtration analysis. 

Whether the S, pyogenes three component polymerase can synthesize 
DNA in as rapid and processive of a fashion as the E. coli Pol m holoenzyme three 
component polymerase is very difficult to predict, because no other DNA polymerase 
known to date catalyzes synthesis at the rate or processivity of the E. coli three 
component polymerase. For example, the three component T4 phage polymerase 
travels about 400 nucleotides/s, the yeast DNA polymerase delta three component 
polymerase travels about 120 nucleotides/s, and the human DNA polymerase delta 
three component enzyme appears slower and less processive than the yeast enzyme. 

The standard test for these speed and processivity characteristics is 
examination of a time course in extension of a primer on a very long template, such as 
around the 7,2 kb M13mpl8 ssDNA genome coated with SSB and primed with a 
synthetic DNA oligonucleotide. The results of experiments of this type demonstrate 
that the three component S, pyogenes polymerase is indeed extremely rapid in 
synthesis. Surprisingly, it is just as fast as the E, coli enzyme. Extension proceeds at 
about 700-800 nucleotides per second, completing the entire template in about 9 
seconds. The enzyme was fiilly processive throughout replication of the M13mpl8 
genome, as could be determined from the fact that some templates were not extended 
at all, while others were extended to completion. If the enzyme had not been 
processive during the entire replication reaction, then when it comes off one partially 
extended DNA genome it would have reassociated with the unextended DNA that 
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remained and partially replicated it as well (and so on until the entire population of 
DNA became ftiliy replicated). This did not happen. Instead, the reaction showed a 
mixture of completely replicated templates and templates that were still untouched 
starting material. This indicates that the enzyme stays with a template until it 
completes it before it cycles over to replicate another one (i.e., it is highly 
processive). Each of the five proteins, a, t, 5. 5' and p, are needed to obtain this rapid 
and processive DNA synthesis. 

This invention has provided an intellectual template by which the 
clamp loader component of these three component polymerases can be obtained from 
any eubacterial prokaryotic cell and how to use it with the other components to 
produce a rapid and processive polymerase. All prokaryotes in the eubacterial 
kingdom that have been sequenced to date contain homologues of these genes. As the 
process of lateral gene transfer appears to be a major force in evolution, it would 
appear that relatedness of enzymes and enzyme machines is best judged by 
comparisons of their genes and proteins rather than by phylogeny of which bacteria 
they are in Poolittle et al, "Archaeal Genomics: Do Archaea have a Mixed 
Heritage?," CuiT. Biol., 8:R209-R21 1 (1998)). As pointed out earlier in this 
application, most bacteria have genetic characteristics of replication genes/proteins of 
S. pyogenes rather than that of coli (i.e., no genes encoding x,^, or 6, only a weak 
homolog to 6, or a dnaX gene encoding only a single protein). 

The dnaX gene encoding t and y in E. coli encodes only one protein in 
some organisms, but, as this application shows, it is still functional in forming a 
protein complex capable of rapid and processive DNA synthesis. In addition, this 
application shows that the delta subunit, which is only weakly homologous among 
different prokaryotic organisms, is an essential functional subunit of the three 
component polymerase (instead of having diverged so as to fulfill an entirely different 
function in some other intracellular process). As mentioned earlier, several genes 
encoding subunits of the E. coli clamp loader (y complex; yA.8',x.^) are not obviously 
presem in other prokaryotes (holC and holD encoding x and y). Hence, one may 
anticipate that other genes may have evolved to encode new subunits that replace 
these, and that these new subunits may have been essential to the activity of the clamp 
loader. For example, they may have either taken over some of the functionality of 
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another subunit, or structurally (e.g., the physical presence of a subunit could be 
needed for one subunit to assume its proper and active conformation, or for one or 
more of the subunits to form a complex together to yield the multisubunit clamp 
loader assembly). In addition, this application shows that the a subunit [polC gene 
5 product) is sufficient for rapid and processive synthesis with the other two 

components (i.e., E. coli requires e submit to bind to a for rapid and processive 
synthesis of a with the p clamp). Finally, this application shows that the S. pyogenes 
three component polymerase synthesizes DNA as fast as the E. coli Pol HI three 
component polymerase. Up to this point, the E. coli Pol in three component 

10 polymerase was over twice the speed of the T4 enzyme and over 5 times the speed of 

others. Hence, it was possible that E. coli may have been unique among prokaryotes 
in having a polymerase that achieves such speed. This invention shows that this is not 
the case. Instead, this speed in polymerization generahzes to the Gram positive 
prokaryotic three component DNA polymerases. It may be presumed, now that two 

15 examples of three component polymerases in widely divergent bacteria share the 

charactistics of rapid, processive synthesis, that the three component polymerase of 
other eubacteria will also be rapid and processive. 

These rapid and processive three component DNA polymerases can be 
applied to several important uses. DNA polymerases currently in use for DNA 

20 sequencing and DNA amplification use enzymes that are much slower and thus could 
be improved upon. This is especially true of amplification as the three component 
polymerase is capable of speed and high processivity making possible amplification of 
very long (tens of Kb to Mb) lengths of DNA in a time efficient manner. These three 
component polymerases also function in conjunction with a replicative helicase 

25 (DnaB) and, thus, are capable of amplification at ambient temperature using the 
helicase to melt the DNA duplex. This property could be useful in amplification 
reaction procedures such as in polymerase chain reaction (PCR) methodology. 
Finally, these three component polymerases and their associated helicase (DnaB) and 
primase (DnaG) are attractive targets for antibiotics due to their essential and central 

30 role in cell viability. 

This application provides a three component polymerase fi"om two 
human pathogens in the Gram positive class. It makes possible the production of this 
three component polymerase firom other bacteria of the Gram positive type (e.g., 
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Streptococci, Staphylococci, Mycoplasma) and other types of bacteria 
lacking x,h/, or 0, those having only one protein produced by their c/woAT gene such as 
obligate intracellular parasites. Mycoplasmas (possibly evolved from Gram positives), 
Cyanobacteria {Synechocystis), Spirochaetes such asBorrelia and Treponemia and 
Chlamydia, and distant relatives of E, coli in the Gram negative class (e.g., Rickettsia 
and Helicobacter), These three component polymerases are useful in manipulation of 
nucleic acids for research and diagnostic purposes (e.g., sequencing and amplification 
methods) and for screening chemicals for antibiotic activity (usefiil in human or 
animal therapy and agriculture such as animal feed supplements). There are several 
assays described previously in U.S. Patent Application Serial No. 09/235,245 to 
O^Donnell et al., which is hereby incorporated by reference, that use these three 
component polymerases (or subassemblies), as well as the DnaB and DnaG 
homologues, either alone or in various combinations, for the purpose of screening 
chemicals, such as chemical libraries, for inhibitor activity. Such inhibitors can be 
developed further (usually by chemical manipulation and alteration) into lead 
compounds and then into full fledged pharmaceuticals. 

There remains a need to understand the molecular details of the process 
of DNA replication in other cells that are quite different from E. coli, such as in Gram 
positive cells. It is possible that a more detailed understanding of replication proteins 
will lead to discovery of new antibiotics. Therefore, a deeper understanding of 
replication proteins of Gram positive bacteria is especially important given the 
emergence of drug resistant strains of these organisms. For example. Staphylococcus 
aureus has successfully mutated to become resistant to all common antibiotics. 

The "target" protein(s) of an antibiotic drug is generally involved in a 
critical cell function, such that blocking its action with a drug causes the pathogenic 
cell to die or no longer proliferate. Current antibiotics are directed to very few targets. 
These include membrane synthesis proteins (e.g.. vancomycin, penicillin, and its 
derivatives such as ampicillin, amoxicillin, and cephalosporin), the ribosome 
machinery (e.g., tetracycline, chloramphenicol, azithromycin, and the aminoglycosides 
such as kanamycin, neomycin, gentamicin, streptomycin), RNA polymerase (e.g., 
rifampimycin), and DNA topoisomerases (e.g., novobiocin, quinolones, and 
fluoroquinolones). The DNA replication apparatus is a crucial life process and, thus, 
the proteins involved in this process are good targets for antibiotics. 
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A powerful approach to discovery of a new drug is to obtain a target 
protein, characterize it, and develop in vitro assays of its cellular function. Large 
chemical libraries can then be screened in the functional assays to identify compounds 
that inhibit the target protein. These candidate pharmaceuticals can then be 
chemically modified to optimize their potency, breadth of antibiotic spectrum, non- 
toxicity, performance in animal models and, finally, clinical trials. The screening of 
large chemical libraries requires a plentiful source of the target protein. An abundant 
supply of protein generally requires overproduction techniques using the gene 
encoding the protein. This is especially true for replication proteins as they are 
present in low abundance in the cell. 

Selective and robust assays are needed to screen reliably a large 
chemical library. The assay should be insensitive to most chemicals in the 
concentration range normally used in the drug discovery process. These assays should 
also be selective and not show inhibition by antibiotics known to target proteins in 
processes outside of replication. 

The present invention is directed to overcoming these deficiencies in 

the art. 

SUMMARY OF THE INVENTION 

The present invention relates to various isolated DNA molecules fi-om 
Staphylococcus aureus and Streptococcus pyogenes, both of which are Gram positive 
bacteria. These include DNA molecules which include a coding region firom the dnaE 
gene (encoding a- small), dnaX geno (encoding tau), polC gene (encoding Pol IE -L 
or a- large), dnaN gene (encoding beta), holA gene (encoding delta), holB gene 
(encoding delta prime), ssb gene (encoding SSB), dnaB gene (encoding DnaB), and 
dnaG gene (encoding DnaG) of S. aureus and/or 5. pyogenes. These DNA molecules 
can be inserted into an expression system and used to transform host cells. The 
isolated proteins or polypeptides encoded by these DNA molecules, and their ability to 
fimction when used in combination is also disclosed. The resulting actions provide 
assembling a ring onto DNA via a clamp loader, and polymerase activity dependent 
on this ring that is rapid and processive. 
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A further aspect of the present invention relates to a method of 
identifying compounds which inhibit activity of a polymerase product of polC or 
dnaE. This method is carried out by forming a reaction mixture comprising a primed 
DNA molecule, a polymerase product of polC or dnaE, a candidate compound, a 
dNTP, and optionally either a beta subunit, a tau complex, or both the beta subunit 
and the tau complex, wherein at least one of the polymerase product of polC or dnaE, 
the beta subunit, the tau complex, or a subunit or combination of subunits thereof is 
derived from a Eubacteria other than Escherichia coli; subjecting the reaction mixture 
to conditions effective to achieve nucleic acid polymerization in the absence of the 
candidate compound; analyzing the reaction mixture for the presence or absence of 
nucleic acid polymerization extension products; and identifying the candidate 
compound in the reaction mixture where there is an absence of nucleic acid 
polymerization extension products. 

The present invention deciphers the structure and mechanism of the 
chromosomal replicase of Gram positive bacteria and other bacteria lacking holQ 
holD, holE or dnaQ genes, or having a dnaX gene that encodes only one protein. 
Rather than use a DNA polymerase that attains high efficiency on its own, or with one 
other subunit, the Gram positive bacteria replicase is a three component type of 
replicase (class IE) that uses a sliding clamp protein. The Gram positive bacteria 
replicase also uses a clamp loader component that assembles the sliding clamp onto 
DNA. This knowledge, and the enzymes involved in the replication process, can be 
used for the purpose of screening for potential antibiotic drugs. Further, information 
about chromosomal replicases may be useful in DNA sequencing, DNA amplification, 
polymerase chain reaction, and other DNA polymerase related techniques. 

The present invention identifies two DNA polymerases (both of Pol III 
type) in Gram positive bacteria that utilize the sliding clamp and clamp loader. The 
present invention also identifies a gene with homology to the alpha subunit ofE. coli 
DNA polymerase HI holoenzyme, the chromosomal replicase ofE, colL These DNA 
polymerases can extend a primer around a large circular natural template when the 
beta clamp has been assembled onto the primed ssDNA by the clamp loader or a 
primer on a linear DNA where the beta clamp may assemble by itself by sliding over 
an end. 
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The present invention shows that the clamp and clamp loader 
components of Gram negative cells can be exchanged for those of Gram positive cells 
in that the clamp, once assembled onto DNA, will function with Pol III obtained from 
either Gram positive and Gram negative sources. This result implies that important 
contacts between the polymerase and clamp have been conserved during evolution. 
Therefore, these "mixed systems" may provide assays for an inhibitor of this 
conserved interaction. Such an inhibitor may be expected to shut down replication, 
and since the interaction is apparently conserved across the evolutionary spectrum 
from Gram positive and Gram negative cells, the inhibitor may exhibit a broad 
spectrum of antibiotic activity. 

The present invention demonstrates that Gram positive bacteria contain 
a beta sub unit that behaves as a sliding clamp that encircles DNA. A dnaX gene 
sequence encoding a protein homolog of the gamma/tau subunit of the clamp loader 
(gamma/tau complex) E. coli DNA polymerase IH holoenzyme is also identified. The 
presence of this gene confirms the presence of a clamp loading apparatus in Gram 
positive bacteria that will assemble beta clamps onto DNA for the DNA polymerases. 

This apphcation also outlines methods and assays for use of these 
replication proteins in drug screening processes. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the construction of the S. aureus Pol III-L expression 
vector. The gene encoding Pol m-L was cloned into a pETl 1 expression vector in a 
three step cloning scheme as illustrated. 

Figures 2A-C describe the expression and purification of S, aureus Pol 
m-L (alpha-large). Figure 2 A compares E, coli cells that contain the pETl IPolC 
expression vector that are either induced or uninduced for protein expression. The gel 
is stained with Coomassie Blue. The induced band corresponds to the expected 
molecular weight of the S. aureus Pol IH-L. and is indicated to the right of the gel. 
Figure 2B shows the results of the MonoQ chromatography of a lysate of E, coli 
(pETl IPolC-L) induced for Pol m-L. The fractions were analyzed in a Coomassie 
Blue stained gel (top) and for DNA synthesis (bottom)^ Fractions containing Pol m-L 
are indicated. In Figure 2C, fractions containing Pol ffl-L from the MonoQ column 
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were pooled and chromatographed on a phosphocellulose column. This shows an 
analysis of the column fractions from the phosphocellulose column in a Coomassie 
Blue stained polyacrylamide gel. The position of Pol IH-L is indicated to the right. 

Figure 3 shows the S. aureus beta expression vector. The dnaN gene 
was amplified from S, aureus genomic DNA and cloned into the pET16 expression 
vector. 

Figures 4A-C illustrate the expression and purification ofS. aureus 
beta. Figure 4A compares E. coli cells that contain the pETI6beta expression vector 
that are either induced or uninduced for protein expression. The gel is stained with 
Coomassie Blue. The induced band corresponds to the expected molecular weight of 
the S, aureus beta, and is indicated to the right of the gel. The migration position of 
size standards are indicated to the left. Figure 4B shows the results of MonoQ 
chromatography of an E, coli (pET16beta) lysate induced for beta. The fractions were 
analyzed in a Coomassie Blue stained gel, and fractions containing beta are indicated. 
In Figure 4C, fractions containing beta from the MonoQ column were pooled and 
chromatographed on a phosphocellulose column. This shows an analysis of the 
column fractions from the phosphocellulose column in a Coomassie Blue stained 
polyacrylamide gel. The position of beta is indicated to the right. 

Figures 5A-B demonstrate that the 5'. aureus beta stimulates 5. aureus 
Pol m-L and E. coli Pol m core on linear DNA, but not circular DNA. In Figure 5A, 
the indicated proteins were added to replication reactions containing polydA-oligodT 
as described in the Examples infra. Amounts of proteins added, when present, were: 
lanes 1,2: 5. aureus Pol HI-L, 7.5 ng; S. aureus beta, 6.2 ng; Lanes 3,4: E, coli Pol m 
core, 45 ng; S. aureus beta, 9.3 fig; Lanes 5,6: E. coli Pol m core, 45 ng; coli beta, 
5ftg. Total DNA synthesis was: Lanes 1-6: 4.4, 30.3, 5.1, 35.5, 0.97, 28.1 pmol, 
respectively. In Figure 5B, Lanes 1-3, the indicated proteins were added to replication 
reactions containing circular singly primed M13mpl8 ssDNA as described in the 
Examples infra, S. aureus beta, 0.8 ^g; S. aureus Pol IH-L, 300 ng (purified through 
MonoQ); E, coli clamp loader complex, 1.7 ng. Results in the E. coli system are 
shown in Lanes 4-6. Total DNA synthesis was: Lanes 1-6: 0.6, 0.36, 0.99, 2.7, 3.5, 
280 pmol, respectively. 

Figure 6 shows that S, aureus Pol HI-L fimctions with E. coli beta and 
clamp loader complex on circular primed DNA. It also shows that 5. aureus beta does 
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not convert Pol DI-L with sufficient processivity to extend the primer all the way 
around a circular DNA. Replication reactions were performed on the circular singly 
primed M13mpl 8 ssDNA. Proteins added to the assay are as indicated in this figure. 
The amount of each protein, when present, is: S, aureus beta, 800 ng; S. aureus Pol 
m-L, 1500 ng (MonoQ fraction 64); E, coli Pol HI core, 450 ng; E. coli beta, 100 ng; 
E, coli gamma complex, 1720 ng. Total DNA synthesis in each assay is indicated at 
the bottom of the figure. 

Figures 7A-B show that 5. aureus contains four distinct DNA 
polymerases. Four different DNA polymerases were partially purified from S. aureus 
cells. S. aureus cell lysate was separated from DNA and, then, chromatographed on a 
MonoQ column. Fractions were analyzed for DNA polymerase activity. Three peaks 
of activity were observed. The second peak was the largest and was expected to be a 
mixture of two DNA polymerases based on early studies in B. subtilis. 
Chromatography of the second peak on phosphocellulose (Figure 7B) resolved two 
DNA polymerases from one another. 

Figures 8A-B show that S, aureus has two DNA Pol El's. The four 
DNA polymerases partially purified from S. aureus extract, designated peaks l-W in 
Figure 7, were assayed on circular singly primed M13mpl8 ssDNA coated with E. 
coli SSB either in the presence or absence of E. coli beta (50ng) and clamp loader 
complex (50 ng). Each reaction contained 2 |il of the partially pure polymerase (Peak 
1 was Mono Q fraction 24 (1 .4 Mg), Peak 2 was phosphocellulose fraction 26 (0.01 6 
mg/ml), Peak 3 was phosphocellulose fraction 46 (0.18 mg/ml), and Peak 4 was 
MonoQ fraction 50 (1 ^g). Figure 8A shows the product analysis in an agarose gel. 
Figure 8B shows the extent of DNA synthesis in each assay. 

Figure 9 compares the homology between the polypeptide encoded by 
dnaE of 5. aureus and other organisms. An alignment is shown for the amino acid 
sequence of the S, aureus dnaE product with the dnaE products (alpha subunits) ofE, 
coli and Salmonella typhimurium. 

Figure 10 compares the homology between the N-terminal regions of 
the gamma/tau polypeptides of S. aureus, B. subtilis, and E, coli. The conserved ATP 
site and the cystines forming the zinc fmger are indicated above the sequence. The 
organisms used in the alignment were: E. coli (GenBank); and B. subtilis. 
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Figure 1 ] compares the homology between the DnaB polypeptide of 
S. aureus and other organisms. The organisms used in the alignment were: E, coli 
(GenBank); 5. subtilis\ SalTyp,, {Salmonella typhimurium). 

Figures 12A-B show the alignment of the delta subunit encoded by 
5 holA for E. coli and B, subtilis (Figure 1 2A) and for the delta subunit of 5. subtilis and 
pyogenes (Figure 12B). Figure 12A shows ClustalW generated alignment of 5. 
pyogenes (Gram positive) delta to E,coli (Gram negative) delta. Figure 12B shows 
ClustalW generated alignment of 5. subtilis (Gram positive) delta to S, pyogenes 
(Gram positive) delta. 

10 Figure 1 3 is an image of an autoradiograph of an agarose gel analysis 

of replication products from singly primed, SSB coated M13mpl8 ssDNA using the 
reconstituted S, aureus Pol EI holozyme. Only in the presence of the t55' complex 
does a-large (PolC) function with p to replicate a full circular duplex DNA (RFEI). 

Figure 14 shows a Comassie Blue stained SDS polyacrylamide gel of 

15 the pure S. pyogenes subunits corresponding to alpha-large, alpha-small, dnaX gene 
product (called tau), beta, delta, delta prime, and SSB. The first lane shows the 
position of molecular weight markers. Purified proteins were separated on a 15% 
SDS-PAGE and stained with Coommassie Brilliant Blue R-250. Each lane contains 5 
microgram of each protein. Lane 1, markers; lane 2, alpha-large; lane 3, alpha-small, 

20 lane 4, tau subunit; lane 5, beta subunit; lane 6, delta subunit; lane 7, delta prime 

subunit; lane 8, single strand DNA binding protein. 

Figures 1 5A-C document the ability to reconstitute the t55' complex of 
pyogenes. Proteins were mixed and gel filtered on Superose 6, followed by analysis 
of the column fractions in a SDS polyacrylamide gel. Figure 15A shows a mixture of 

25 t56*. Figure 15B shows a mixture of t6. Figure 15C shows a mixture of t5'. 

Figures 16A-E show that the S. pyogenes xbb* complex can load the 5. 
pyogenes beta clamp onto (circular) DNA. Loading reactions contained 500 fm 
nicked pBSK plasmid, 500 fin either t56' complex, tau, delta, or delta prime, 1pm 32p- 
labelled beta dimer, 8 mM MgCl2, 1 mM ATP. Reaction components were 

30 preincubated for 10 min at 3TC prior to loading onto 5 ml Biogel Al 5M column 
equilibrated with buffer A containing 100 mM NaCL Figure 16A demonstrates the 
ability of t55' complex to load the beta dimer onto a nicked pBSK circular plasmid. 
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Figures I6B-E show the results of using either: beta aJone (Figure 16B); 65' plus p 
(Figure 16C); x, 5 and p (Figure 16D); t, 5' and p (Figure 16E). 

Figures 17A-C show that x and alpha interact. Figure 17A shows the 
result of gel filtration analysis of a mixture of x with alpha-large. Gel filtration 
fractions are analyzed in a SDS polyacrylamide gel. Figures 17B and 17C show the 
results using only x or only alpha-large, respectively. Comparison of the elution 
positions of proteins shows that the positions of alpha and tau are shifted toward a 
higher molecular weight complex when they are present together. The fact they do 
not exactly comigrate may indicate that they initially are together in a complex, but 
that the complex dissociates during the time of the gel filtration experiment (over one 
halfhour). 

Figures 18A-B document the ability to reconstitute aLx65' (pol m*) 
complex ofS. pyogenes. Proteins were mixed, preincubated for 20 min at 1 5°C, gel 
filtered on Superose 6, followed by analysis of the column fi-actions in a SDS 
polyacrylamide gel (Figure 18A). Proteins were loaded on a MonoQ column, then 
eluted with a linear gradient of 50-500 mM NaCl, followed by analysis of the column 
fractions in a SDS polyacrylamide gel (Figure 18B). The aLT65' complex migrates 
early. 

Figure 19 illustrates the speed and processivity of the S. pyogenes 
aLx56' (pol in*) complex. The aLx66' (pol m*) complex was incubated with primed 
Ml 3pm 18 ssDNA (coated with S. pyogenes SSB) and only two dhJTPs, then 
replication was initiated upon adding the remaining two dNTPs. Reactions contained 
25 finol singly primed M13mpl8 ssDNA template, 300 finol P2, and either 75 finol or 
250 finol aLx65'. Time points were quenched with SDS/EDTA then analyzed in a 
neutral agarose gel followed by autoradiography. Each time point is a separate 
reaction. The time course of polymerization was performed at two different ratios of 
polymerase/primed template to assess speed and processivity of nucleotide 
incorporation. 

Figures 20A-I show the extent of homology between S. pyogenes 
replication genes and other organisms. Due to the low homology of delta 
(Figure 20D), one must "walk" from one organism to the next in order to recognize 
the homologue with high probability. Percent identity over regions of the indicated 
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number of amino acid residues is shown for each match (i.e., the two organisms at the 
opposite ends of each hne). Amino acid sequences were retrieved from either 
GenBank or individual unfinished genome databases. 

Figure 21 A-F are images illustrating that the S. pyogenes DnaE (alpha- 
small) polymerase functions with j3. Figures 21 A-B illustrate the relationship between 
DnaE and p for association with ssDNA, Different amounts of DnaE polymerase 
were added to a SSB coated M13mpl 8 ssDNA circle primed with a single DNA 
ohgonucleotide, and products were analyzed in a native agarose gel. Reactions were 
performed in the presence of t66' and either the absence (Figure 21C, panels 1-4) or 
presence (Figure 2 ID, panels 1-4) of p. Positions of completed duplex (RFII) and 
initial primed template (ssDNA) are indicated. Figure 2 IE shows an analysis of 
exonuclease activity by PolC and DnaE on a 5'-32P-DNA 30-mer. Aliquots were 
removed at the indicated times and analyzed in a sequencing gel. Figure 2 IF shows 
the effect of TMAU on PolC and DnaE in the presence of t56' and p. DNA products 
were analyzed in a native agarose gel. Positions of initial primed MlBmpl 8 (ssDNA) 
and completed circular duplex (RFII) are indicated. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to various isolated nucleic acid molecules 
from Gram positive bacteria and other bacteria lacking holQ holD, or holE genes or 
having a dnaX gmo encoding only one subunit. These include DNA molecules which 
correspond to the coding regions of the dnaE, dnaX, holA, holB, polQ dnaN, SSB, 
dnaB, and dnaG genes. These DNA molecules can be inserted mto an expression 
system or used to transform host cells. The isolated proteins or polypeptides encoded 
by these DNA molecules and their use to form a three component polymerase are also 
disclosed. Also encompassed by the present invention are corresponding RNA 
molecules transcribed from the DNA molecules. 

These DNA molecules and proteins can be derived from numerous 
bacteria, including Staphylococcus, Streptococcus, Enterococcus, Mycoplasma, 
Mycobacterium, Borrelia, Treponema, Rickettsia, Chlamydia, Helicobacter, and 
Thermatoga, It is particularly directed to such DNA molecules and proteins derived 
from Streptococcus and Staphylococcus bacteria, particularly Streptococcus pyogenes 
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and Staphylococcus aureus (see U.S. Patent Application Serial No. 09/235,245, which 
is hereby incorporated by reference). 

The gene sequences used to obtain DNA molecules of the present 
invention were obtained by sequence comparisons with the E. coli counterparts, 
followed by detailed analysis of the raw sequence data in the contigs from the 5. 
pyogenes database (http://dnal.chem.ou.edu7strep.html) or the S. aureus database 
(http://www.genome.ou.edu/staph.html) to identify the open reading frames. In many 
instances, nucleotide errors were observed causing frameshifls in the open reading 
frame thus truncating it. Therefore, upon cloning the genes via PGR, the genes were 
sequenced to obtain correct information. Also, the full nucleotide sequence of the ssb 
gene was not present in the data base. This was cloned by circular PGR and the full 
sequence is reported below. 

The S, aureus dnaXmd dnaE genes were identified by aligning genes 
of several organisms and designing primers for use in PGR to obtain a gene fragment, 
followed by steps to identify the entire gene. 

One aspect of the present invention relates to a newly discovered Pol 
in gene (herein identified as dnaE) of S. aureus whose encoded protein is homologous 
to E. coli alpha (product of dnaE gene). The partial nucleotide sequence of the S, 
aureus dnaE gene corresponds to SEQ. ID. No. 1 as follows: 



atggtggcat atttaaatat tcatacggct tatgatttgt taaattcaag cttaaaaata 60 
gaagatgccg taagacttgc tgtgtctgaa aatgttgaCg cacttgccat aactgacacc 120 
aatgtattgt atggttttcc taaattttat gatgcatgta tagcaaataa cattaaaccg 180 
atttttggta tgacaatata tgtgacaaat ggattaaata cagtcgaaac agttgtCcta 240 
gctaaaaata atgatggatt aaaagatttg tatcaactat catcggaaat aaaaatgaat 300 
gcattagaac atgtgtcgtt tgaattatta aaacgatttt ctaacaatat gattatcatt 360 
tttaaaaaag tcggtgatca acatcgtgat attgtacaag tgtttgaaac ccataatgac 420 
acatatatgg accaccttag Catttcgatt caaggtagaa aacatgtttg gattcaaaat 4 80 
gtttgttacc aaacacgtca agatgccgat acgatttctg cattagcagc tattagagac 540 
aatacaaaat tagacttaat tcatgatcaa gaagattttg gtgcacattt tttaactgaa 600 
aaggaaatta atcaattaga tattaaccaa gaatatttaa cgcaggttga tgttatagct 660 
caaaagtgtg atgcagaatt aaaatatcat caatctctac ttcctcaata tgagacacct 720 
aatgatgaat cagctaaaaa atatttgtgg cgtgtcttag ttacacaatt gaaaaaatta 780 
gaacttaatt atgacgtcta tttagagcga ttgaaatatg agtataaagt tattactaat 840 
atgggttttg aagattattt cttaatagta agtgatttaa tccattatgc gaaaacgaat 900 
gatgtgatgg taggtcctgg tcgtggttct tcagctggct cactggtcag ttatttattg 960 
ggaattacaa cgattgatcc tattaaattc aatctattat ttgaacgttt tttaaaccca 1020 
gaacgtgtaa caatgcctga tattgatatt gactttgaag atacacgccg agaaagggtc 1080 
attcagtacg tccaagaaaa atatggcgag ctacatgtat ctggaattgt gactttcggt 1140 
catctgcttg caagagcagt tgctagagat gttggaagaa ttatggggtt tgatgaagtt 1200 
acattaaatg aaatctcaag tttaatccca cataaattag gaattacact tgatgaagca 1260 
tatcaaattg acgattttaa agagtttgta catcgaaacc atcgacatga acgctggttc 1320 
agtatttgta aaaagttaga aggtttacca agacatacat ctacacatgc ggcaggaatt 13 80 
attattaatg accatccatt atatgaatat gcccctttaa cgaaagggga tacaggatta 1440 
ttaacgcaat ggacaatgac tgaagccgaa cgtattgggt tatcaaaaat agattttcta 1500 
gggttgagaa acttatcgat tattcatcaa atcttaacac aagtcaaaaa agatttaggt 1560 



wo 01/09164 



-26- 



PCT/USOO/20666 



attaatattg atatcgaaaa gattccgttt gatgatcaaa aagtgtttga attgttgtcg 1620 
caaggagata cgactggcat attccaatta gagtctgacg gtgtaagaag tgtattaaaa 1680 
aaattaaagc cggaacactt tgaagatatt gttgctgtaa cttctttgta tagaccaggt 1740 
ccaatggaag aaattccaac ttacattaca agaagacatg atccaagcaa agttcaatat 1800 
ttacatccgc atttagaacc Catattaaaa aatacttacg gtgttattat ttatcaagag 1860 
caaactatgc aaatagcgag cacatctgca aacttcagtt atggtgaagc ggatatttta 1920 
agaagagcaa tgagtaaaaa aaatagagct gttcttgaaa gtgagcgtca acattttata 1980 
gaaggtgcaa agcaaaatgg ttatcacgaa gacattagta agcaaatatt tgatttgatt 2040 
ctgaaatttg ctgattatgg ttttcctaga gcacatgctg tcagctattc taaaattgca 2100 
tacattatga gctttttaaa agtccattat ccaaattatt tttacgcaaa taCtttaagt 2160 
aatgttattg gaagtgagaa gaaaactgct caaatgatag aagaagcaaa aaaacaaggt 2220 
atcactatat tgccaccgaa cattaacgaa agtcattggt tttataaacc ttcccaagaa 2280 
ggcatttatt tatcaattgg tacaattaaa ggtgttggtt atcaaagtgt gaaagtgatt 2340 
gttgatgaac gttatcagaa cggcaaattt aaagatttct ttgattttgc tagacgtata 24 00 
ccgaagagag tcaaaacgag aaagttactt gaagcactga ttttagtggg agcgtttgat 2460 
gcttttggta aaacacgttc aacgttgttg caagctattg atcaagtgtt ggatggcgat 2520 
ttaaacattg aacaagatgg ttttttattt gatattttaa cgccaaaaca gatgtatgaa 2580 
gataaagaag aattgcctga tgcacttatt agtcagtacg aaaaagaata tttaggattt 2640 
tatgtttcgc aacacccagt agataaaaag tttgttgcca aacaatattt aacgatattt 2700 
aaattgagta acgcgcagaa ttataaacct atattagtac agtttgataa agttaaacaa 2760 
attcgaacta aaaatggtca aaatatggca ttcgtcacat taaatgatgg cattgaaact 2820 
ttagatggtg tgattttccc taatcagttt aaaaagtacg aagagttgCt atcacataat 2880 
gacttgttta tagttagcgg gaaatttgac catagaaagc aacaacgtca actaattata 2940 
aatgagattc agacattagc cacttttgaa gaacaaaaat tagcatttgc caaacaaatt 3000 
ataattagaa ataaatcaca aatagatatg tttgaagaga tgattaaagc tacgaaagag 3060 
aatgctaatg atgttgtgtt atccttttat gatgaaacga ttaaacaaat gactacttta 3120 
ggctatatta atcaaaaaga tagtatgttt aataatttta tacaatcctt taaccctagt 3180 
gatattaggc ttata 3195 

^0 The S. aureus dnaE encoded protein, called a-small, has an amino acid 

sequence corresponding to SEQ. ID. No, 2 as follows: 



10 



15 



20 



25 



Met Val Ala Tyr Leu Asn He His Thr Ala Tyr Asp Leu Leu Asn Ser 
15 10 15 

Ser Leu Lys He Glu Asp Ala Val Arg Leu Ala Val Ser Glu Asn Val 
20 25 30 

Asp Ala Leu Ala He Thr Asp Thr Asn Val Leu Tyr Gly Phe Pro Lys 
35 40 45 

Phe Tyr Asp Ala Cys He Ala Asn Asn He Lys Pro He Phe Gly Met 
50 55 60 

45 Thr He Tyr Val Thr Asn Gly Leu Asn Thr Val Glu Thr Val Val Leu 

70 75 80 



35 



40 



50 



55 



60 



Ala Lys Asn Asn Asp Gly Leu Lys Asp Leu Tyr Gin Leu Ser Ser Glu 
85 90 95 

He Lys Met Asn Ala Leu Glu His Val Ser Phe Glu Leu Leu Lys Arg 
100 105 110 

Phe Ser Asn Asn Met He He He Phe Lys Lys Val Gly Asp Gin His 
115 120 125 

Arg Asp He Val Gin Val Phe Glu Thr His Asn Asp Thr Tyr Met Asp 
130 135 140 

His Leu Ser He Ser He Gin Gly Arg Lys His Val Trp He Gin Asn 
145 150 155 160 
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Val Cys Tyr Gin Thr Arg Gin Asp Ala Asp Thr He Ser Ala Leu Ala 
165 170 175 

Ala He Arg Asp Asn Thr Lys Leu Asp Leu He His Asp Gin Glu Asp 
IBO 185 190 

Phe Gly Ala His Phe Leu Thr Glu Lys Glu He Asn Gin Leu Asp He 
195 200 205 

Asn Gin Glu Tyr Leu Thr Gin Val Asp Val He Ala Gin Lys Cys Asp 
210 215 220 



Ala Glu Leu Lys Tyr His Gin Ser Leu Leu Pro Gin Tyr Glu Thr Pro 
225 230 235 



240 



Asn Asp Glu Ser Ala Lys Lys Tyr Leu Trp Arg Val Leu Val Thr Gin 
245 250 255 

Leu Lys Lys Leu Glu Leu Asn Tyr Asp Val Tyr Leu Glu Arg Leu Lys 
260 265 270 

Tyr Glu Tyr Lys Val He Thr Asn Met Gly Phe Glu Asp Tyr Phe Leu 
275 280 285 

He Val Ser Asp Leu He His Tyr Ala Lys Thr Asn Asp Val Met Val 
290 295 300 

Gly Pro Gly Arg Gly Ser Ser Ala Gly Ser Leu Val Ser Tyr Leu Leu 
305 310 315 320 

Gly He Thr Thr He Asp Pro He Lys Phe Asn Leu Leu Phe Glu Arg 
325 330 335 

Phe Leu Asn Pro Glu Arg Val Thr Met Pro Asp He Asp He Asp Phe 
340 345 350 

Glu Asp Thr Arg Arg Glu Arg Val He- Gin Tyr Val Gin Glu Lys Tyr 
355 360 365 

Gly Glu Leu His Val Ser Gly He Val Thr Phe Gly His Leu Leu Ala 
370 375 380 

Arg Ala Val Ala Arg Asp Val Gly Arg He Met Gly Phe Asp Glu Val 
390 395 400 

Thr Leu Asn Glu He Ser Ser Leu He Pro His Lys Leu Gly He Thr 
405 410 415 

Leu Asp Glu Ala Tyr Gin He Asp Asp Phe Lys Glu Phe Val His Arg 
420 425 430 

Asn His Arg His Glu Arg Trp Phe Ser He Cys Lys Lys Leu Glu Gly 
435 440 445 

Leu Pro Arg His Thr Ser Thr His Ala Ala Gly He He He Asn Asp 
450 455 



His Pro Leu Tyr Glu Tyr Ala Pro Leu Thr Lys Gly Asp Thr Gly Leu 
465 470 475 



480 



Leu Thr Gin Trp Thr Met Thr Glu Ala Glu Arg He Gly Leu Leu Lys 
485 490 495 
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lie Asp Phe Leu Gly Leu Arg Asn Leu Ser lie He His Gin He Leu 
500 505 510 

Thr Gin Val Lys Lys Asp Leu Gly He Asn He Asp He Glu Lys He 
515 520 ^ 525 

Pro Phe Asp Asp Gin Lys Val Phe Glu Leu Leu Ser Gin Gly Asp Thr 
530 535 540 

Thr Gly He Phe Gin Leu Glu Ser Asp Gly Val Arg Ser Val Leu Lys 
545 550 555 560 

Lys Leu Lys Pro Glu His Phe Glu Asp He Val Ala Val Thr Ser Leu 
565 570 575 

Tyr Arg Pro Gly Pro Met Glu Glu He Pro Thr Tyr He Thr Arg Arg 
580 585 590 

His Asp Pro Ser Lys Val Gin Tyr Leu His Pro His Leu Glu Pro He 
595 600 605 

Leu Lys Asn Thr Tyr Gly Val He He Tyr Gin Glu Gin He Met Gin 
610 615 620 

He Ala Ser Thr Phe Ala Asn Phe Ser Tyr Gly Glu Ala Asp He Leu 
625 630 635 640 

Arg Arg Ala Met Ser Lys Lys Asn Arg Ala Val Leu Glu Ser Glu Arg 
645 650 655 

Gin His Phe He Glu Gly Ala Lys Gin Asn Gly Tyr His Glu Asp He 
660 665 670 

Ser Lys Gin He Phe Asp Leu He Leu Lys Phe Ala Asp Tyr Gly Phe 
675 680 685 

Pro Arg Ala His Ala Val Ser Tyr Ser Lys He Ala Tyr He Met Ser 
690 695 700 

Phe Leu Lys Val His Tyr Pro Asn Tyr Phe Tyr Ala Asn He Leu Ser 
705 710 715 720 

Asn Val He Gly Ser Glu Lys Lys Thr Ala Gin Met He Glu Glu Ala 
725 730 735 

Lys Lys Gin Gly He Thr He Leu Pro Pro Asn He Asn Glu Ser His 
740 745 750 

Trp Phe Tyr Lys Pro Ser Gin Glu Gly He Tyr Leu Ser He Gly Thr 
755 760 765 

He Lys Gly Val Gly Tyr Gin Ser Val Lys Val He Val Asp Glu Arg 
770 775 780 

Tyr Gin Asn Gly Lys Phe Lys Asp Phe Phe Asp Phe Ala Arg Arg He 
785 790 795 BOO 



Pro Lys Arg Val Lys Thr Arg Lys Leu Leu Glu Ala Leu He Leu Val 
805 810 815 

Gly Ala Phe Asp Ala Phe Gly Lys Thr Arg Ser Thr Leu Leu Gin Ala 
820 825 830 
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He Asp Gin Val Leu Asp Gly Asp Leu Asn He Glu Gin Asp Gly Phe 
035 040 845 

Leu Phe Asp He Leu Thr Pro Lys Gin Met Tyr Glu Asp Lys Glu Glu 

855 860 

Leu Pro Asp Ala Leu He Ser Gin Tyr Glu Lys Glu Tyr Leu Gly Phe 



865 870 875 



880 



Tyr Val Ser Gin His Pro Val Asp Lys Lys Phe Val Ala Lys Gin Tyr 
885 890 895 

Leu Thr He Phe Lys Leu Ser Asn Ala Gin Asn Tyr Lys Pro He Leu 

905 910 

Val Gin Phe Asp Lys Val Lys Gin He Arg Thr Lys Asn Gly Gin Asn 
915 920 

Met Ala Phe Val Thr Leu Asn Asp Gly He Glu Thr Leu Asp Gly Val 
930 935 940 

He Phe Pro Asn Gin Phe Lys Lys Tyr Glu Glu Leu Leu Ser His Asn 
550 955 

Asp Leu Phe He Val Ser Gly Lys Phe Asp His Arg Lys Gin Gin Arg 
965 

Gin Leu He He Asn Glu He Gin Thr Leu Ala Thr Phe Glu Glu Gin 
980 985 

Lys Leu Ala Phe Ala Lys Gin He He He Arg Asn Lys Ser Gin He 
995 1000 1005 

Asp Met Phe Glu Glu Met He Lys Ala Thr Lys Glu Asn Ala Asn Asp 
1010 1015 1020 

val Val Leu Ser Phe Tyr Asp Glu Thr He Lys Gin Met Thr Thr Leu 
1030 1035 1040 

Gly Tyr He Asn Gin Lys Asp Ser Met Phe Asn Asn Phe He Gin Ser 
1045 1050 1055 

Phe Asn Pro Ser Asp He Arg Leu He 
1060 1065 

The present invention also relates to the S. aureus dnaXgene. This 
S. aureus dnaX gene has a partial nucleotide sequence coiiesponding to SEQ. ID. 
No. 3 as follows: 



ttgaattatc aagccttata tcgtatgtac agaccccaaa gtttcgagga tgtcgtcqqa 60 

tatlttttti oto^r'^"' attgcgcaat gcgatttcga aagaalalca gtcgcatgca x'S 

Scaatcaact f^llllT^ ^^^l^""^^^^ aaaacgagta ttgccaaagt gtttgctlaa 180 

^^^^^^^^^^ cactgatgga gaaccttgta acgaatgtca tatttgtaaa 240 

ITrllll^'' aggggactaa ttcagatgtg atagaaattg atgctgctag taataltgjc 300 

Iaaqtt?ata IIT.IT^ tagagacaaa gttaaatatg caccaagtgl atcgaaatat 360 

aaaacattao ^^of^ ggtgcacatg ctaacaacag gtgcttttaa tgccctttta 420 

aagacgttag aagaacctcc agcacacgct atttttatat tggcaacgac agaaccacat 480 

oatcaaftto ^f^^^^'"' ""^999=^ caacgtttCg attttaalgc alttagccta III 

gatcaaattg ttgaacgttt aaaatttgta gcagatgcac aacaaattga atgtgaagat 600 

gaagccttgg catttatcgc taaagcgtct gaagggggta tgcgtgatlc atLagtatt 66^ 
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atggatcagg ctattgcttt cggcgatggc acattgacat tacaagatgc cctaaatgtt 720 

acgggtagcg ttcatgatga agcgttggat cacttgtttg atgatattgt acaaggtgac 780 

gtacaagcat cttctaaaaa ataccatcag tttataacag aaggtaaaga agtgaatcgc 840 

ctaataaatg atatgatcta Ctttgtcaga gatacgatta tgaataaaac atctgagaaa 900 

gatactgagt atcgagcact gatgaactta gaattagata tgttatatca aatgattgat 960 

cttattaatg atacattagt gtcgattcgt tttagtgtga atcaaaacgt tcattttgaa 1020 

gtattgttag taaaattagc tgagcagatt aagggtcaac cacaagtgat tgcgaatgta 1080 

gctgaaccag cacaaattgc ttcatcgcca aacacagatg tattgttgca acgtatggaa 1140 

cagttagagc aagaactaaa aacactaaaa gcacaaggag tgagtgttgc tcctactcaa 1200 

aaatcttcga aaaagcctgc gagaggtata caaaaatcta aaaatgcatt ttcaatgcaa 1260 

caaattgcaa aagtgctaga taaagcgaat aaggcagata tcaaattgtt gaaagatcat 1320 

tggcaagaag tgattgacca tgcccaaaac aatgataaaa aatcactcgt tagtttattg 13 80 

caaaattcgg aacctgtggc ggcaagtgaa gatcacgtcc tCgtgaaatt tgaggaagag 1440 

atccattgtg aaatcgtcaa taaagacgac gagaaacgta gtagtataga aagtgttgta 1500 

tgtaatatcg ttaataaaaa cgttaaagtt gttggtgtac catcagatca atggcaaaga 1550 

gttcgaacgg agtatttaca aaaccgtaaa aacgaaggcg atgatatgcc aaagcaacaa 1620 

gcacaacaaa cagatattgc tcaaaaagca aaagaccttt tcggtgaaga aactgtacat 1680 

gtgatagatg aagagtga -.gag 



The S. aureus dnaX encoded protein (i.e., the tau subunit) has a partial' 
amino acid sequence corresponding to SEQ. ID. No. 4 as follows: 



Leu Asn Tyr Gin Ala Leu Tyr Arg Met Tyr Arg Pro Gin Ser Phe Glu 
^5 10 15 

Asp Val Val Gly Gin Glu His Val Thr Lys Thr Leu Arg Asn Ala He 
20 25 30 

Ser Lys Glu Lys Gin Ser His Ala Tyr He Phe Ser Gly Pro Arq Gly 
35 40 45 

Thr Gly Lys Thr Ser He Ala Lys Val Phe Ala Lys Ala He Asn Cys 
50 55 60 

Leu Asn Ser Thr Asp Gly Glu Pro Cys Asn Glu Cys His He Cys Lys 
70 75 80 

Gly He Thr Gin Gly Thr Asn Ser Asp Val He Glu He Asp Ala Ala 
85 90 95 

Ser Asn Asn Gly Val Asp Glu He Arg Asn He Arg Asp Lys Val Lys 
100 105 110 

Tyr Ala Pro Ser Glu Ser Lys Tyr Lys Val Tyr He He Asp Glu Val 
115 120 125 

His Met Leu Thr Thr Gly Ala Phe Asn Ala Leu Leu Lys Thr Leu Glu 
130 135 

Glu Pro Pro Ala His Ala He Phe He Leu Ala Thr Thr Glu Pro His 
150 155 160 

Lys He Pro Pro Thr He He Ser Arg Ala Gin Arg Phe Asp Phe Lys 
165 170 175 

Ala He Ser Leu Asp Gin He Val Glu Arg Leu Lys Phe Val Ala Asp 
180 185 190 

Ala Gin Gin He Glu Cys Glu Asp Glu Ala Leu Ala Phe He Ala Lys 
195 200 205 
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Ala Ser Glu Gly 
210 

He Ala Phe Gly 
225 

Thr Gly Ser Val 



Val Gin Gly Asp 
260 

Thr Glu Gly Lys 
275 

Val Arg Asp Thr 
290 

Arg Ala Leu Met 
305 

Leu He Asn Asp 



Val His Phe Glu 
340 

Gin Pro Gin Val 
355 

Ser Pro Asn Thr 
370 

Glu Leu Lys Thr 
385 

Lys Ser Ser Lys 



Phe Ser Met Gin 
420 

Asp He Lys Leu 
435 

Gin Asn Asn Asp 
450 

Pro Val Ala Ala 
465 

He His Cys Glu 



Glu Ser Val Val 
500 

Val Pro Ser Asp 
515 

Arg Lys Asn Glu 
530 



Gly Met Arg Asp 
215 

Asp Gly Thr Leu 
230 

His Asp Glu Ala 
245 

Val Gin Ala Ser 



Glu Val Asn Arg 
280 

He Met Asn Lys 
295 

Asn Leu Glu Leu 
310 

Thr Leu Val Ser 
325 

Val Leu Leu Val 



He Ala Asn Val 
360 

Asp Val Leu Leu 
375 

Leu Lys Ala Gin 
390 

Lys Pro Ala Arg 
405 

Gin He Ala Lys 



Leu Lys Asp His 
440 

Lys Lys Ser Leu 
455 

Ser Glu Asp His 
470 

He Val Asn Lys 
485 

Cys Asn He Val 



Gin Trp Gin Arg 
520 

Gly Asp Asp Met 
535 



Ala Leu Ser He 
220 

Thr Leu Gin Asp 
235 

Leu Asp His Leu 
250 

Phe Lys Lys Tyr 
265 

Leu He Asn Asp 



Thr Ser Glu Lys 
300 

Asp Met Leu Tyr 
315 

He Arg Phe Ser 
330 



Lys Leu Ala Glu 
345 

Ala Glu Pro Ala 



Gin Arg Met Glu 
380 

Gly Val Ser Val 
395 

Gly He Gin Lys 
410 

Val Leu Asp Lys 
425 

Trp Gin Glu Val 



Val Ser Leu Leu 
460 

Val Leu Val Lys 
475 

Asp Asp Glu Lys 
490 

Asn Lys Asn Val 
505 

Val Arg Thr Glu 



Pro Lys Gin Gin 
540 



Met Asp Gin Ala 



Ala Leu Asn Val 
240 

Phe Asp Asp He 
255 

His Gin Phe He 
270 

Met He Tyr Phe 
285 

Asp Thr Glu Tyr 



Gin Met He Asp 
320 

Val Asn Gin Asn 
335 

Gin He Lys Gly 
350 

Gin He Ala Ser 
365 

Gin Leu Glu Gin 



Ala Pro Thr Gin 
400 

Ser Lys Asn Ala 
415 

Ala Asn Lys Ala 
430 

He Asp His Ala 
445 

Gin Asn Ser Glu 



Phe Glu Glu Glu 
480 

Arg Ser Ser He 
495 

Lys Val Val Gly 
510 

Tyr Leu Gin Asn 
525 

Ala Gin Gin Thr 
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Asp He Ala Gin Lys Ala Lys Asp Leu Phe Gly Glu Glu Thr Val His 
545 550 555 560 

Val He Asp Glu Glu Glx 
565 

The tau subunit ofS. aureus functions as does both the tau subunit and the gamma 
subunit of £. coli. 

This invention also relates to the partial nucleotide sequence of the 
S. aureus dnaB gene. The partial nucleotide sequence of this dnaB gene corresponds 
to SEQ. K). No. 5 as follows: 



atggatagaa tgtatgagca aaatcaaatg ccgcataaca atgaagctga acagtctgtc 60 
ttaggttcaa ttattataga tccagaattg attaatacta ctcaggaagt tttgcttcct 120 
gagtcgtttt ataggggtgc ccatcaacat attttccgtg caatgatgca cttaaatgaa 180 
gataataaag aaattgatgt tgtaacattg atggatcaat taccgacgga aggtacgttg 240 
aatgaagcgg gtggcccgca atatcttgca gagttatcta caaatgtacc aacgacgcga 300 
aatgttcagt attatactga tatcgtttct aagcatgcat taaaacgtag attgattcaa 360 
actgcagata gtattgccaa tgatggatat aatgatgaac ttgaactaga tgcgatttta 420 
agtgatgcag aacgtcgaat tttagagcta tcatcttctc gtgaaagcga tggctttaaa 480 
gacattcgag.acgtcttagg acaagtgtat gaaacagctg aagagcttga tcaaaatagt 54 0 
ggtcaaacac caggtatacc tacaggatat cgagatttag accaaatgac agcagggttc 600 
aaccgaaatg atttaattat ccttgcagcg cgtccatctg taggtaagac tgcgttcgca 660 
cttaatattg cacaaaaagc tgcaacgcat gaagatatgt atacagttgg tattttctcg 720 
ctagagatgg gtgctgatca gttagccaca cgtatgattt gtagttctgg aaatgttgac 780 
tcaaaccgct taagaacggg tactatgact gaggaagatt ggagtcgttt tactatagcg 840 
gtaggtaaat tatcacgtac gaagattttt attgatgata caccgggtat tcgaattaat 900 
gatttacgtt ctaaatgtcg tcgattaaag caagaacatg gcttagacat gattgtgatt 960 
gactacttac agttgattca aggtagtggt tcacgtgcgt ccgataacag acaacaggaa 1020 
gtttctgaaa tctctcgtac attaaaagca ttagcccgtg aattaaaatg tccagttatc 1080 
gcactaagtc agttatctcg tggtgttgaa caacgacaag ataaacgtcc aatgatgagt 1140 
gatattcgtg aatctggttc gattgagcaa gatgccgata tcgttgcatt cttataccgt 1200 
gatgattact ataaccgtgg cggcgatgaa gatgatgacg atgatggtgg tttcgagcca 1260 
caaacgaatg atgaaaacgg tgaaattgaa attatcattg ctaagcaacg taacggtcca 1320 
acaggcacag ttaagttaca ttttacgaaa caatataata aatttaccga tatcgattat 1380 
gcacatgcag atatgatg i^sq 

The amino acid sequence of 5. aureus DnaB encoded by the dnaB gene 
corresponds to SEQ. ID. No. 6 as follows: 



Met Asp Arg Met Tyr Glu Gin Asn 
1 5 

Glu Gin Ser Val Leu Gly Ser lie 
20 

Thr Thr Gin Glu Val Leu Leu Pro 
35 40 

Gin His lie Phe Arg Ala Met Met 
50 55 

lie Asp Val Val Thr Leu Met Asp 
65 70 



Gin Met Pro His Asn Asn Glu Ala 
10 15 

He He Asp Pro Glu Leu He Asn 
25 30 

Glu Ser Phe Tyr Arg Gly Ala His 
45 

His Leu Asn Glu Asp Asn Lys Glu 
60 

Gin Leu Ser Thr Glu Gly Thr Leu 
75 80 
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Asn Glu Ala Gly Gly Pro Gin Tyr Leu Ala Glu Leu Ser Thr Asn Val 
85 90 95 

Pro Thr Thr Arg Asn Val Gin Tyr Tyr Thr Asp lie Val Ser Lys His 

105 110 

Ala Leu Lys Arg Arg Leu He Gin Thr Ala Asp Ser lie Ala Asn Asp 

120 125 

Gly Tyr Asn Asp Glu Leu Glu Leu Asp Ala lie Leu Ser Asp Ala Glu 
130 135 

Arg Arg He Leu Glu Leu Ser Ser Ser Arg Glu Ser Asp Gly Phe Lys 

155 160 

Asp He Arg Asp Val Leu Gly Gin Val Tyr Glu Thr Ala Glu Glu Leu 
165 170 

Asp Gin Asn Ser Gly Gin Thr Pro Gly He Pro Thr Gly Tyr Arq Asd 
IfiO 185 190 ^ 

Leu Asp Gin Met Thr Ala Gly Phe Asn Arg Asn Asp Leu He He Leu 
195 200 205 

Ala Ala Arg Pro Ser Val Gly Lys Thr Ala Phe Ala Leu Asn He Ala 
210 215 220 

Gin Lys Val Ala Thr His Glu Asp Met Tyr Thr Val Gly He Phe Se 



225 230 



235 240 



Leu Glu Met Gly Ala Asp Gin Leu Ala Thr Arg Met He Cys Ser Ser 
245 250 255 

Gly Asn Val Asp Ser Asn Arg Leu Arg Thr Gly Thr Met Thr Glu Glu 
260 265 270 

Asp Trp Ser Arg Phe Thr He Ala Val Gly Lys Leu Ser Arg Thr Lys 
2*75 280 285 

He Phe He Asp Asp Thr Pro Gly He Arg He Asn Asp Leu Arg Ser 

295 300 

Lys Cys Arg Arg Leu Lys Gin Glu His Gly Leu Asp Met He Val He 

310 315 320 

Asp Tyr Leu Gin Leu He Gin Gly Ser Gly Ser Arg Ala Ser Asp Asn 
325 330 335 

Arg Gin Gin Glu Val Ser Glu He Ser Arg Thr Leu Lys Ala Leu Ala 
340 345 35^^ 

Arg Glu Leu Lys Cys Pro Val He Ala Leu Ser Gin Leu Ser Arg Gly 
355 360 365 

Val Glu Gin Arg Gin Asp Lys Arg Pro Met Met Ser Asp He Arg Glu 
370 375 380 

ser Gly Ser He Glu Gin Asp Ala Asp He Val Ala Phe Leu Tyr Arg 

390 395 

Asp Asp Tyr Tyr Asn Arg Gly Gly Asp Glu Asp Asp Asp Asp Asp Gly 
405 410 415 
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Gly Phe Glu Pro Gin Thr Asn Asp Glu Asn Gly Glu He Glu He Us 



420 42q 

^^=> 430 

Gin Arg Asn Gly Pro Thr Glv Th-r vai i.,,o 

435 



lie Ala Lys Gin Arg Asn Gly Pro Thr Gly Thr Val Lys Leu His Phe 

440 445 

Met Lys Gin Tyr Asn Lys Phe Thr Asp He Asp Tyr Ala His Ala Asp 
450 455 H 



Met Met 
465 



The present invention also relates to the S. aureus polC gene (encoding 
Pol m-L or a-large). The partial nucleotide sequence of this polC gene corresponds 
to SEQ. ED. No. 7 as follows: 



qa?qctoaIa tttr^^^^^ taaagtgctt gctgatcaaa ttaaaatttc aaatcaatta 60 
Icataaaaat aggtgaactg acacgtatag atgtttctaa caaaaacaga 120 

ataaS^^ ttcatattac attaccacaa ttcttagetc atgaagatta tttatCattt 180 
ataaatgcaa tagagcaaga gtttaaagat atcgccaacg ttacatgtcg ttttacggta 240 
acaaatggca cgaatcaaga tgaacatgca attaaatact ttgggcactg tattqaccaa 300 
aa^otattaa ''^TT^^ ^aaaggtcaa ttgaaacaga aall^ctta? t^cg^cCgg^ 360 
aatooaaofo ^^^taatggt atcaaatgac attgaacgta atcattttga taaggcat^t 420 
aaaacaaI^^ ^^f ^^f^'' gtttagaaat tgtggttttg atatcgataa aatcltattc 480 
lll^.llT.l ^'=^^,'9^'^^^ agaacaaaac ttagcttctt tagaagcaca tattcaagaa 54 0 
miT^ aaagtgcacg attggcaaca gagaaacttg aaaaaatgaa agctgaalaa 600 
gcgaaacaac aagacaacaa cgaaagtgct gtcgataagt gtcaaattgg taagccjacc 660 
caaattgaaa atattaaacc aatCgaatct attattgagg aaqagtttal aqttacaata 720 
l^lfSZ III'^IT' ^^^'^^-^ 9aacttLII gtlg^cgc^: tltStaga^ 7 0 
pataat?taa T^r-l.ltT'' 9gactcttta gttttaaaaa tgtttactcg taaaaaciaa 840 
gatgatttag aacattttaa agcgctaagt gttggtaaat gggttagggc tcaaggtcgc 900 
aaaaaao^^^ ^tacatttat tagagattta gttatgatga tglctgltat tgaagaga?t 960 
gcaaScc aaataoa^T .'"^^^ctgaa gaaaagcgtg tagaattcca cttgLtact 1020 
gcaatgagcc aaatggatgg tatacccaat actggtgcgt atgttaaaca ggcagcagac 1080 
tggggacatc cagccattgc ggtcacagac cataatgttg tgcaagcatt tccagatgct 1140 
cacgcagcag cggaaaaaca tggcattaaa atgatatacg gtatggaagg Catgttaitt 1200 
o^^S'^ "ccgattgc atacaaacca caagatgtcl fattlaaagl tgc^actfa^ 1260 
gttgtgttcg acgttgagac aactggttta tcaaatcagt atgataaaat catcgaqctt 1320 
caSqIt ta?c'"''" caa-^gaaatc atcgataagt tt^aaaggtt t^tccg 380 
catgaacgat Catcggaaac gattatcaat ttgacgcata ttactgatqa tatqttaata I44r> 
gatgcccctg agattgaaga agtacttaca gagttfaaag aatgg^ttgg cgatgcglta "oo 
ttcgtagcgc ataatgcttc gtttgatatg ggcttcatc| atac|gga?I tgaacgfctt ^560 
SaS^ catcaacgaa tggtgttatc gatactttag aattltctcg tacga?taat iHl 
actgaatatg gtaaacatgg tttgaattto ttggctaaaa aatatggcgt agaattaaca leao 
^f'T"^ gtgccattta tgaCacagaa gcaacagctt acattftcat afaaatggt? Xlll 
qaaaatacft ^rf'^^^ cgtattaaat cataacgaaa tcaacaaaaa actcagfaat \loo 
gaagatgcat ataaacgtgc aagacctagt catgtcacac taattgtaca aaaccaacaa i860 
ggtcttaaaa atctatttaa aattgtaagt gcatcattgg tgaagtattt ctaccqtaca 1920 
tato^tLf T^"^^" gttagatgaa tatcgtgagg gattLtggt aggtacagcg 
a?tqccaaaf tlTal^ll atgcagaagg accagaglL a^ttgaalal 2040 

actgccaaat attatgattt tattgaaatt caaccaccgg cactttatca aqatttaatt ?ifto 
gatagagagc ttattagaga tactgaaaca ttacatgall tttatcaacg tftaatacat 2^60 
gcaggtgaca cagcgggtat acctgttatt gcgacaggaa atgcacact! tttgcttqaS ^220 
tcalct??^" -^cgcacgtaa aatttcaata gcatcaclac ccggcaatcc act^aatgc 2280 
tcaactttac cggaagcaca ctttagaact acagatgaaa tgttaaacga gtttcatttt 2340 
attoaa^o^^ aaaaagcgca tgaaattgtt gtglaalata claacgaa^t Igcagatcga 2400 
attgaacgcg ttgttcctat taaagatgaa ttatacacac cgcgtatgga aggtgctafc 2460 
gaagaaatta gagaactaag ttatgcaaat gcgcgtaaac tgtltggtla aglcctqcot 2520 
g'^ttfact ta^ttt^:^' agaaaaagaa ttaaaaagta tLtcg^tL tggatt^ 2580 
gtaatttact taatctcgca acgtttagtt aaaaaatcat tagatgatgg atacctaatt Pfioo 
ggttcccgtg gttcagtagg ttctagtttt gtagcgacaa tgactgagit tactgaagta 2700 
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oa^^^!o^f ^^^"^^'^ta tatttgtccg aactgtaaaa cgagtgaatt tttcaatgat 2760 
ggttcagtag gatcaggatt tgatttacct gataagacgt gtgaaacttg tggagcgcca 2820 
cttattaaag aaggacaaga tatCccgttt gaaacatttt taggatttaa ggqaqataaa 2880 
llTtT.'t ^^9^^"^- ctttagtggt gaatatcaac cgaatgccca llLlTclll 2940 
f!^!!^ ^5 ttggtgagga taaagtattc cgtgcaggta caattggCac tgttgctgaa 3000 
aagactgctt ttggttatgt taaaggttat tcgaatgatc aaggtatcca caaaagaggt 3060 
gctgaaatag atcgactcgt taaaggatgt acaggtgcta aacgtacaac tggacagcat 3120 
ccagggggca ttattgtagt acctgattac atggatattt atgattttac gccgatacaa 3180 
tatcctgccg atgatcaaaa ttcagcatgg atgacgacac actttgatct ccattctatt 3240 
catgataatg tatcaaaact tgatatactt ggacacgacg atccaacaat gattcgtatg 3300 
"^f^^ga^^t tgatccaaaa acaatacctg tagatgataa agaagttatg 3360 
Infimi^ 9tacacctga aagtccgggt gttacCgaag atgaaatttt atgtaaaaca 3420 
ggtacatttg gggtaccaga attcggtaca ggactcgtgc gtcaaatgct agaagataca 34 80 
aagccaacaa cattttctga attagttcaa atctcaggat tatctcatgg taca|atgtg 3540 
atgctcaaga attaattaaa accggtatat gtgatttatc aagtgtaatt 3600 
fflltllf^ atgataccac ggtttattta atgtatgctg gttCagaacc atcaatggct 3660 
^r^^^fo^ tggagtcagt acgtaaaggt aaaggtttaa ctgaagaaat gattgaaacg 3720 
atgaaagaaa atgaagtgcc agattggtat ttagattcat gtcttaaaat taagtacatg 3780 
o^f^f^^T cccatgcagc agcatacgtt ttaatggcag tacgtaCcgc atatttcaaa 3840 
gtacatcatc cactttatta ctatgcatct tactttacaa ttcgtgcgtc agactttgat 3900 
ttaatcacga tgattaaaga taaaacaagc attcgaaaca ctgtaaaaga catgtattct 3960 
^tllt^^^^ atctaggtaa aaaagaaaaa gacgcattaa cagtcttgga aattatgaat 4020 
o^^f^?^^^^ atcgaggtta tcgaatgcaa ccgattagtt tagaaaagag tcaggcgtCc 4080 
ttgaaggcga tacacttatt ccgccgttca tatcagtgcc tgggcttggc 4140 
gaaaacgttg cgaaacgaat tgttgaagct cgtgacgatg gcccattttt atcaaaagaa 4200 
gatttaaaca aaaaagctgg attatctcag aaaattattg agtatttaga tgagttaggc 4260 
tcattaccga atttaccaga taaagctcaa ctttcgatat tCgatatg 4308 

The amino acid sequence of the S. aureus polC gene product, a-Iarge, 
corresponds to SEQ. ID. No. 8 as follows: 



Met Thr Glu Gin Gin Lys Phe Lys Val Leu Ala Asp Gin lie Lys He 
1 5 10 15 

Ser Asn Gin Leu Asp Ala Glu He Leu Asn Ser Gly Glu Leu Thr Ara 
20 25 30 

He Asp Val Ser Asn Lys Asn Arg Thr Trp Glu Phe His He Thr Leu 
35 40 45 

Pro Gin Phe Leu Ala His Glu Asp Tyr Leu Leu Phe He Asn Ala He 
=° 55 60 

Glu Gin Glu Phe Lys Asp He Ala Asn Val Thr Cys Arg Phe Thr Val 

'0 75 80 

Thr Asn Gly Thr Asn Gin Asp Glu His Ala He Lys Tyr Phe Gly His 



85 90 



95 



Cys He Asp Gin Thr Ala Leu Ser Pro Lys Val Lys Gly Gin Leu Lys 
100 105 110 

Gin Lys Lys Leu He Met Ser Gly Lys Val Leu Lys Val Met Val Ser 



115 120 



125 



Asn Asp He Glu Arg Asn His Phe Asp Lys Ala Cys Asn Gly Ser Leu 
130 135 

He Lys Ala Phe Arg Asn Cys Gly Phe Asp He Asp Lys He He Phe 
150 155 . 160 
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Glu Thr Asn Asp Asn Asp Gin Glu Gin Asn Leu Ala Ser Leu Glu Ale 



165 



170 



175 



His lie Gin Glu Glu Asp Glu Gin Ser Ala Arg Leu Ala Thr Glu Lys 
180 185 

Leu Glu Lys Met Lys Ala Glu Lys Ala Lys Gin Gin Asp Asn Lys Gin 
195 200 205 

Ser Ala Val Asp Lys Cys Gin He Gly Lys Pro He Gin He Glu Asn 
210 215 220 

lie Lys Pro He Glu Ser He He Glu Glu Glu Phe Lys Val Ala He 
230 235 240 

Glu Gly Val He Phe Asp He Asn Leu Lys Glu Leu Lys Ser Gly Arg 
245 250 255 

His He Val Glu He Lys Val Thr Asp Tyr Thr Asp Ser Leu Val Leu 
260 265 270 

Lys Met Phe Thr Arg Lys Asn Lys Asp Asp Leu Glu His Phe Lys Ala 
275 280 285 

Leu Ser Val Gly Lys Trp Val Arg Ala Gin Gly Arg He Glu Glu 



290 



295 



300 



Asp 



Thr Phe He Arg Asp Leu Val Met Met Met Ser Asp He Glu Glu He 

315 320 

Lys Lys Ala Thr Lys Lys Asp Lys Ala Glu Glu Lys Arg Val Glu Phe 
325 330 

His Leu His Thr Ala Met Ser Gin Met Asp Gly He Pro Asn He Gly 
340 345 350 

Ala Tyr Val Lys Gin Ala Ala Asp Trp Gly His Pro Ala He Ala Val 
355 360 2S5 

Thr Asp His Asn Val Val Gin Ala Phe Pro Asp Ala His Ala Ala Ala 

375 380 

Glu Lys His Gly He Lys Met He Tyr Gly Met Glu Gly Met Leu Val 

395 400 

Asp Asp Gly Val Pro He Ala Tyr Lys Pro Gin Asp Val Val Leu Lys 
405 410 415 

Asp Ala Thr Tyr Val Val Phe Asp Val Glu Thr Thr Gly Leu Ser Asn 
420 425 



430 



Gin Tyr Asp Lys He He Glu Leu Ala Ala Val Lys Val His Asn 



435 



440 



445 



Gly 



Glu He He Asp Lys Phe Glu Arg Phe Ser Asn Pro His Glu Ara Leu 
450 455 ^^rt ^ 



460 



Ser Glu Thr He He Asn Leu Thr His He Thr Asp Asp Met Leu Val 

475 480 

Asp Ala Pro Glu He Glu Glu Val Leu Thr Glu Phe Lys Glu Trp Val 
485 490 495 
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Gly Asp Ala He Phe Val Ala His Asn Ala Ser Phe Asp Met Gly Phe 

505 510 



500 



He Asp Thr Gly Tyr Glu Arg Leu Gly Phe Gly Pro Ser Thr Asn Gly 
515 520 525 ^ 

Val lie Asp Thr Leu Glu Leu Ser Arg Thr He Asn Thr Glu Tyr Gly 
5J0 535 540 

Lys His Gly Leu Asn Phe Leu Ala Lys Lys Tyr Gly Val Glu Leu Thr 



550 



555 



560 



Gin His His Arg Ala He Tyr Asp Thr Glu Ala Thr Ala Tyr He Phe 
565 570 

He Lys Met Val Gin Gin Met Lys Glu Leu Gly Val Leu Asn His Asn 
580 585 



590 



Glu He Asn Lys Lys Leu Ser Asn Glu Asp Ala Tyr Lys Arg Ala Arg 
595 600 605 

Pro ser His Val Thr Leu He Val Gin Asn Gin Gin Gly Leu Lys Asn 

615 620 

Leu Phe Lys He Val Ser Ala Ser Leu Val Lys Tyr Phe Tyr Arg Thr 



630 



635 



640 



Pro Arg He Pro Arg Ser Leu Leu Asp Glu Tyr Arg Glu Gly Leu Leu 
645 650 



655 



Val Gly Thr Ala Cys Asp Glu Gly Glu Leu Phe Thr Ala Val Met Gin 
660 665 



670 



Lys Asp Gin Ser Gin Val Glu Lys He Ala Lys Tyr Tyr Asp Phe He 
675 680 685 

Glu He Gin Pro Pro Ala Leu Tyr Gin Asp Leu He Asp Arg Glu Leu 

He Arg Asp Thr Glu Thr Leu His Glu He Tyr Gin Arg Leu He Hi 
710 715 „ 



s 

720 



Ala Gly Asp Thr Ala Gly He Pro Val He Ala Thr Gly Asn Ala His 
725 730 735 

Tyr Leu Phe Glu His Asp Gly He Ala Arg Lys He Leu He Ala Ser 
740 745 



750 



Gin Pro Gly Asn Pro Leu Asn Arg Ser Thr Leu Pro Glu Ala His Phe 

760 765 

Arg Thr Thr Asp Glu Met Leu Asn Glu Phe His Phe Leu Gly Glu Glu 

775 780 

Lys Ala His Glu He Val Val Lys Asn Thr Asn Glu Leu Ala Asp Arg 
"^50 755 ^ 

He Glu Arg Val Val Pro He Lys Asp Glu Leu Tyr Thr Pro Arg Met 
805 810 815 

Glu Gly Ala Asn Glu Glu He Arg Glu Leu Ser Tyr Ala Asn Ala Arg 



825 



830 
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Lys Leu Tyr Gly Glu Asp Leu Pro Gin He Val He Asp Arg Leu Glu 
835 840 845 

Lys Glu Leu Lys Ser He He Gly Asn Gly Phe Ala Val He Tyr Leu 
850 855 860 

He Ser Gin Arg Leu Val Lys Lys Ser Leu Asp Asp Gly Tyr Leu Val 
870 875 880 

Gly Ser Arg Gly Ser Val Gly Ser Ser Phe Val Ala Thr Met Thr Glu 
885 890 895 

He Thr Glu Val Asn Pro Leu Pro Pro His Tyr He Cys Pro Asn Cys 
900 905 910 

Lys Thr Ser Glu Phe Phe Asn Asp Gly Ser Val Gly Ser Gly Phe Asp 
915 920 925 

Leu Pro Asp Lys Thr Cys Glu Thr Cys Gly Ala Pro Leu He Lys Glu 
930 935 940 

Gly Gin Asp He Pro Phe Glu Lys Phe Leu Gly Phe Lys Gly Asp Lys 
950 955 960 

Val Pro Asp He Asp Leu Asn Phe Ser Gly Glu Tyr Gin Pro Asn Ala 
965 970 975 

His Asn Tyr Thr Lys Val Leu Phe Gly Glu Asp Lys Val Phe Arq Ala 
980 985 990 

Gly Thr He Gly Thr Val Ala Glu Lys Thr Ala Phe Gly Tyr Val Lys 
995 1000 1005 

Gly Tyr Leu Asn Asp Gin Gly He His Lys Arg Gly Ala Glu He Asp 
1010 1015 ^020 

Arg Leu Val Lys Gly Cys Thr Gly Val Lys Ala Thr Thr Gly Gin His 
^^25 1030 1035 1040 

Pro Gly Gly He He Val Val Pro Asp Tyr Met Asp He Tyr Asp Phe 
1045 1050 1055 

Thr Pro He Gin Tyr Pro Ala Asp Asp Gin Asn Ser Ala Trp Met Thr 
1060 1065 1070 

Thr His Phe Asp Phe His Ser He His Asp Asn Val Leu Lys Leu Asp 
1075 1080 1085 

He Leu Gly His Asp Asp Pro Thr Met He Arg Met Leu Gin Asp Leu 
1090 1095 1100 

Ser Gly He Asp Pro Lys Thr He Pro Val Asp Asp Lys Glu Val Met 
1110 1115 1120 

Gin He Phe Ser Thr Pro Glu Ser Leu Gly Val Thr Glu Asp Glu He 
1125 1130 1135 

Leu Cys Lys Thr Gly Thr Phe Gly Val Pro Asn Ser Asp Arg He Arq 
1140 1145 1150 

Arg Gin Met Leu Glu Asp Thr Lys Pro Thr Thr Phe Ser Glu Leu Val 
1155 1160 1165 
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Gin He Ser Gly Leu Ser His Gly Thr Asp Val Trp Leu Gly Asn Ala 
1170 1175 iiQo 

Gin Glu Leu He Lys Thr Gly He Cys Asp Leu Ser Ser Val He Gly 
^^^^ 1190 1195 1200 

Cys Arg Asp Asp He Met Val Tyr Leu Met Tyr Ala Gly Leu Glu Pro 
1205 1210 1215 

Ser Met Ala Phe Lys He Met Glu Ser Val Arg Lys Gly Lys Gly Leu 
1220 1225 1230 

Thr Glu Glu Met He Glu Thr Met Lys Glu Asn Glu Val Pro Asp Trp 
1235 1240 1245 

Tyr Leu Asp Ser Cys Leu Lys He Lys Tyr He Phe Pro Lys Ala His 
1250 1255 1260 

Ala Ala Ala Tyr Val Leu Met Ala Val Arg He Ala Tyr Phe Lys Val 
^265 1270 1275 i280 

His His Pro Leu Tyr Tyr Tyr Ala Ser Tyr Phe Thr He Arg Ala Ser 
1285 1290 1295 

Asp Phe Asp Leu He Thr Met He Lys Asp Lys Thr Ser He Arg Asn 
1300 1305 1310 

Thr Val Lys Asp Met Tyr Ser Arg Tyr Met Asp Leu Gly Lys Lys Glu 
1315 1320 1325 

Lys Asp Val Leu Thr Val Leu Glu He Met Asn Glu Met Ala His Arq 
1330 1335 1340 

Gly Tyr Arg Met Gin Pro He Ser Leu Glu Lys Ser Gin Ala Phe Glu 
1345 1350 1355 1360 

Phe He He Glu Gly Asp Thr Leu He Pro Pro Phe He Ser Val Pro 
1365 1370 1375 

Gly Leu Gly Glu Asn Val Ala Lys Arg He Val Glu Ala Arg Asp Asp 
1380 1385 1390 

Gly Pro Phe Leu Ser Lys Glu Asp Leu Asn Lys Lys Ala Gly Leu Tyr 
1395 1400 1405 

Gin Lys He He Glu Tyr Leu Asp Glu Leu Gly Ser Leu Pro Asn Leu 
1410 1415 1420 

Pro Asp Lys Ala Gin Leu Ser He Phe Asp Met 
1425 1430 1435 

This invention also relates to the S, aureus dnaN gcno encoding the 
beta subunit. The partial nucleotide sequence of this dnaN gene corresponds to SEQ. 
ID. No. 9 as follows: 



atgatggaat tcactattaa aagagattat tttattacac aattaaatga cacattaaaa 

gctatttcac caagaacaac attacctata ttaactggta tcaaaatcga tgcgaaagaa 

catgaagtta tattaactgg ttcagactct gaaatttcaa tagaaatcac tattcctaaa 

actgtagatg gcgaagatat tgtcaatatt tcagaaacag gctcagtagt acttcctgga 
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cgattctttg ttgatattat aaaaaaatta cctggtaaag atgttaaatt atctacaaat 300 

gaacaattcc agacattaat tacatcaggt cattctgaat ttaatttgag tggcttagat 360 

ccagatcaat atcctttatt acctcaagtt tctagagatg acgcaattca attgtcggta 420 

aaagcactta aaaacgtgat tgcacaaacg aattttgcag tgtccacctc agaaacacgc 480 

ccagtactaa ctggtgtgaa ctggcttata caagaaaatg aattaatatg cacagcgact 54 0 

gattcacacc gcttggctgt aagaaagttg cagttagaag atgtttctga aaacaaaaat 600 

gtcatcattc caggtaaggc tttagctgaa ttaaataaaa ttatgtctga caatgaagaa 660 

gacattgata tcttctttgc ttcaaaccaa gttttattta aagttggaaa tgtgaacttt 720 
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Ser His Ala Tyr 
20 

Val Ala lie Asn 
35 

Cys Glu Thr Lys 
50 

Ser Thr Thr Glu 
65 

Arg His Met Asn 



lie Glu Asp Phe 
100 

Ser lie Leu Lys 
115 

Leu Leu Ser Thr 
130 

Cys Gin His Val 
145 

Arg Leu Val Glu 



Thr Tyr Thr Thr 
180 

Asp Leu Leu Ala 
195 

Thr Asn Lys Pro 
210 

Ala Lys Asn Lys 
225 

Phe Phe Glu Asp 



He Tyr Ser Asp 
260 

Ser Phe Asn Gin 
275 

Lys Lys Leu Asn 
290 

Val He Lys Gly 
305 



Leu Phe Glu Gly 



Phe Ala Lys Leu 
40 

Val Ser Thr Tyr 
55 

Asn Ala He Lys 
70 

Gin Leu Pro He 

85 

Glu Asp Phe Glu 



Phe Leu Glu Glu 
120 

Lys Pro Glu Gin 
135 

Tyr Phe Lys Pro 
150 

Gin Asn Met Ser 
165 

Gin He Asp Asn 



Leu Arg Lys Ser 
200 

Met Ala Leu He 
215 

Lys Leu Gin Ser 
230 

He He His Thr 
245 

Leu Lys Asn Asp 



Leu He Leu Met 
280 

Gin Asn Val Asn 
295 

Val Ser 
310 



Asp Asp Ala Gin 
25 

He Leu Cys Gin 



Asn His Pro Asp 
60 

Lys Glu Gin Val 
75 

Glu Ser Thr Asn 
90 

Lys Leu Thr Val 
105 

Pro Pro Asp Asn 



He Leu Asp Thr 
140 

He Asp Lys Glu 
155 

Lys Pro Val Ala 
170 

Ala Met Ala Leu 
185 

Val He Arg Trp 



Gly He He Asp 
220 

Leu Thr He Ala 
235 

Lys Val Asn Val 
250 

He Asp Gin 'Tyr 
265 

Phe Asp Gin Leu 



Pro Thr Leu Val 
300 



Thr Met Lys Gin 
30 

Thr Asp Ser Gin 
45 

Phe Met Tyr He 



Glu Gin Leu Val 
80 

Lys Val Tyr He 
95 

Gin Gly Glu Asn 
110 

Thr He Ala He 
125 

He His Ser Arg 



Lys Phe He Asn 
160 

Glu Met He Ser 
175 

Asn Glu Glu Phe 
190 

Glu Leu Leu Leu 
205 

Leu Leu Lys Gin 



Ala Val Asn Gly 
240 

Glu Asp Lys Gin 
255 

Ala Gin Lys Leu 
270 

Thr Glu Ala His 
285 

Phe Glu Gin He 



This invention also relates to the S. aureus holB gene encoding the 
delta prime subunit. The partial nucleotide sequence of this holB gene corresponds to 
SEQ. ID. No. 13 as follows: 
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atgagcgaca atattgtagc tatttatgga gatgtgcctg aattggttga aaaacaaagt 60 

gcagaaatca tatcacaatt tttgaaaagt gatagagatg actttaactt tgtgaaatat 120 

aatttatacg aaacagagat tgcaccaatt gttgaagaaa cattaacatt gcctttcttt 180 

tcagataaaa aagcaatttt ggttaaaaat gcatatatat ttacaggtga aaaagcgcca 240 

aaagatatgg ctcataatgt agaccaatta atagaattta ttgaaaaata tgatggcgaa 300 

aatttgactg tctttgagat atatcaaaat aaacttgatg aaagaaaaaa gttaactaaa 360 

actctaaaaa aqcatqcaaq qcttaaaaaa ataqaqcaqa tqtcqqaqqa qatcaaqtqq 420 
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Glu Ala Thr Gin Ser Asn Ser Asn Val Gin He Ala Ser Asp Asp Leu 

65 — 



70 



75 



80 



Gin Met lie Glu Met His Glu Leu lie Gin Glu Phe Tyr Tyr Tyr Ala 
85 90 95 

Leu Thr Lys Thr Val Glu Gly Glu Gin Ala Leu Thr Tyr Leu Gin Glu 
100 105 110 

Arg Gly Phe Thr Asp Ala Leu He Lys Glu Arg Gly He Gly Phe Ala 



115 



120 



125 



Pro Asp Ser Ser His Phe Cys His Asp Phe Leu Gin Lys Lys Gly Tyr 
130 135 140 

Asp He Glu Leu Ala Tyr Glu Ala Gly Leu Leu Ser Arg Asn Glu Glu 

150 155 160 

Asn Phe Ser Tyr Tyr Asp Arg Phe Arg Asn Arg He Met Phe Pro Leu 
165 170 

Lys Asn Ala Gin Gly Arg He Val Gly Tyr Ser Gly Arg Thr Tyr Thr 
ISO 185 190 

Gly Gin Glu Pro Lys Tyr Leu Asn Ser Pro Glu Thr Pro He Phe Gin 



195 



200 



205 



Lys Arg Lys Leu Leu Tyr Asn Leu Asp Lys Ala Arg Lys Ser He Arq 
210 215 220 

Lys Leu Asp Glu He Val Leu Leu Glu Gly Phe Met Asp Val He Lys 
230 235 240 

Ser Asp Thr Ala Gly Leu Lys Asn Val Val Ala Thr Met Gly Thr Gin 
245 250 255 

Leu Ser Asp Glu His He Thr Phe He Arg Lys Leu Thr Ser Asn He 
260 265 270 

Thr Leu Met Phe Asp Gly Asp Phe Ala Gly Ser Glu Ala Thr Leu Lys 
275 280 285 

Thr Gly Gin His Leu Leu Gin Gin Gly Leu Asn Val Phe Val He Gin 
290 295 300 

Leu Pro Ser Gly Met Asp Pro Asp Glu Tyr He Gly Lys Tyr Glv Asn 

310 315 320 

Asp Ala Phe Thr Thr Phe Val Lys Asn Asp Lys Lys Ser Phe Ala His 
325" 330 335 

Tyr Lys Val Ser He Leu Lys Asp Glu He Ala His Asn Asp Leu Ser 
340 345 350 

Tyr Glu Arg Tyr Leu Lys Glu Leu Ser His Asp He Ser Leu Met Lys 
355 360 365 

Ser Ser He Leu Gin Gin Lys Ala He Asn Asp Val Ala Pro Phe Phe 
370 375 380 

Asn Val Ser Pro Glu Gin Leu Ala Asn Glu He Gin Phe Asn Gin Ala 

390 395 400 
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Pro Ala Asn Tyr Tyr Pro Glu Asp Glu Tyr Gly Gly Tyr Asp Glu Tyr 
405 410 415 

Gly Gly Tyr He Glu Pro Glu Pro He Gly Met Ala Gin Phe Asp Asn 
420 425 430 

Leu Ser Arg Arg Glu Lys Ala Glu Arg Ala Phe Leu Lys His Leu Met 
435 440 445 
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Lys Val His Ser Val Ser Arg Leu Trp Glu Phe His Phe Ala Phe Ala 
35 40 45 

Ala Val Leu Pro He Ala Thr Tyr Arg Glu Leu His Asp Arg Leu He 
50 55 60 

Arg Thr Phe Glu Ala Ala Asp He Lys Val Thr Phe Asp He Gin Ala 
65 70 75 80 

Ala Gin Val Asp Tyr Ser Asp Asp Leu Leu Gin Ala Tyr Tyr Gin Glu 
85 90 95 

Ala Phe Glu His Ala Pro Cys Asn Ser Ala Ser Phe Lys Ser Ser Phe 
100 105 110 

Ser Lys Leu Lys Val Thr Tyr Glu Asp Asp Lys Leu He He Ala Ala 
115 120 125 

Pro Gly Phe Val Asn Asn Asp His Phe Arg Asn Asn His Leu Pro Asn 
130 135 140 

Leu Val Lys Gin Leu Glu Ala Phe Gly Phe Gly He Leu Thr He Asp 
145 150 155 160 

Met Val Ser Asp Gin Glu Met Thr Glu His Leu Thr Lys Asn Phe Val 
165 170 175 

Ser Ser Arg Gin Ala Leu Val Lys Lys Ala Val Gin Asp Asn Leu Glu 
180 185 190 

Ala Gin Lys Ser Leu Glu Ala Met Met Pro Pro Val Glu Glu Ala Thr 
195 200 205 

Pro Ala Pro Lys Phe Asp Tyr Lys Glu Arg Ala Ala Lys Arg Gin Ala 
210 215 220 

Gly Phe Glu Lys Ala Thr He Thr Pro Met He Glu He Glu Thr Glu 
225 230 235 240 

Glu Asn Arg He Val Phe Glu Gly Met Val Phe Asp Val Glu Arg Lys 
245 250 255 

Thr Thr Arg Thr Gly Arg His He He Asn Phe Lys Met Thr Asp Tyr 
260 265 270 

Thr Ser Ser Phe Ala Leu Gin Lys Trp Ala Lys Asp Asp Glu Glu Leu 
275 280 285 

Arg Lys Phe Asp Met He Ala Lys Gly Ala Trp Leu Arg Val Gin Gly 
290 295 300 

Asn He Glu Thr Asn Pro Phe Thr Lys Ser Leu Thr Met Asn Val Gin 
305 310 315 320 

Gin Val Lys Glu He Val Arg His Glu Arg Lys Asp Leu Met Pro Glu 
325 330 335 

Gly Gin Lys Arg Val Glu Leu His Ala His Thr Asn Met Ser Thr Met 
340 345 350 

Asp Ala Leu Pro Thr Val Glu Ser Leu He Asp Thr Ala Ala Lys Trp 
355 360 365 
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Gly His Lys Ala He Ala He Thr Asp His Ala Asn Val Gin Ser Phe 

375 380 

Pro His Gly Tyr His Arg Ala Arg Lys Ala Gly He Lys Ala He Phe 
390 395 

Gly Leu Glu Ala Asn He Val Glu Asp Lys Val Pro He Ser Tyr Glu 
405 410 
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Gin Pro Leu Val Val Arg Glu Leu He Lys Asp Gin Ala Gly He Glu 

Gin val He Arg Asp Leu He Glu Val Gly Lys Arg Ala Lys Lys Pro 
725 730 735 



Val Leu Ala Thr Gly Aen Val His Tyr Leu Glu Pro Glu Glu Glu He 
740 



Tyr Arg Glu He He Val Arg Ser Leu Gly Gin Gly Ala Met He Asn 
755 760 765 

Arg Thr He Gly Arg Gly Glu Gly Ala Gin Pro Ala Pro Leu Pro Lys 

''0 775 ion. ' 



780 



Leu 
800 



Ala His Phe Arg Thr Thr Asn Glu Met Leu Asp Glu Phe Ala Phe 
790 795 

Gly Lys Asp Leu Ala Tyr Gin Val Val Val Gin Asn Thr Gin Asp Phe 
SOS 810 ai5 

Ala Asp Arg He Glu Glu Val Glu Val Val Lys Gly Asp Leu Tyr Thr 

825 830 

Pro Tyr He Asp Lys Ala Glu Glu Thr Val Ala Glu Leu Thr Tyr Gin 
835 840 

Lys Ala Phe Glu He Tyr Gly Asn Pro Leu Pro Asp He He Asp Leu 

Arg He Glu Lys Glu Leu Thr Ser He Leu Gly Asn Gly Phe Ala Val 
870 875 880 

He Tyr Leu Ala Ser Gin Met Leu Val Asn Arg Ser Asn Glu Arg Gly 
885 890 895 

Tyr Leu Val Gly Ser Arg Gly Ser Val Gly Ser Ser Phe Val Ala Thr 

905 

Met He Gly He Thr Glu Val Asn Pro Met Pro Pro His Tyr Val Cys 

920 925 

Pro ser Cys Gin His Ser Glu Phe He Thr Asp Gly Ser Val Gly Ser 
"° 935 940 

Gly Tyr Asp Leu Pro Asn Lys Pro Cys Pro Lys Cys Gly Thr Pro Tyr 
950 



960 



Gin Lys Asp Gly Gin Asp He Pro Phe Glu Thr Phe Leu Gly Phe Asp 

970 

Gly Asp Lys Val Pro Asp He Asp Leu Asn Phe Ser Gly Asp Asp Gin 

985 990 

Pro Ser Ala His Leu Asp Val Arg Asp He Phe Gly Asp Glu Tyr Ala 
535 1000 1005 

"^^imn ^^"^ "^"^ <31" I-ys Thr Ala Tyr Gly 

1015 1020 

Phe val Lys Gly Tyr Glu Arg Asp Tyr Gly Lys Phe Tyr Arg Asp Ala 
" 1°30 1035 
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Glu Val Asp Arg Leu Ala Ala Gly Ala Ala Gly Val bys Arg Thr Thr 
1045 1050 1055 

Gly Gin His Pro Gly Gly He Val Val He Pro Asn Tyr Met Asp Val 

1065 1070 

Tyr Asp Phe Thr Pro Val Gin Tyr Pro Ala Asp Asp Val Thr Ala Ser 
1075 1080 1085 

"^"^inlo ^^"^ ^^"^ ""^^ Asp Glu Asn Val Leu 

1090 1095 1100 

Lys Leu Asp He Leu Gly His Asp Asp Pro Thr Met He Arg Lys Leu 
1110 1115 1120 

Gin Asp Leu ser Gly He Asp Pro He Thr He Pro Ala Asp Asp Pro 
1125 1130 1135 

Gly Val Met Ala Leu Phe Ser Gly Thr Glu Val Leu Gly Val Thr Pro 
1140 1145 j^i5o 

Glu Gin lie Gly Thr Pro Thr Gly Met Leu Gly He Pro Glu Phe Gly 
1155 1160 1165 ^ 

Thr Asn Phe Val Arg Gly Met Val Asn Glu Thr His Pro Thr Thr Phe 
1170 1175 1180 

Ala Glu Leu Leu Gin Leu Ser Gly Leu Ser His Gly Thr Asp Val Trp 
^^^^ 1150 1195 ^2j,P 

Leu Gly Asn Ala Gin Asp Leu He Lys Glu Gly He Ala Thr Leu Lys 
1205 1210 1215 

Thr Val He Gly Cys Arg Asp Asp He Met Val Tyr Leu Met His Ala 
1220 1225 1230 

Gly Leu Glu Pro Lys Met Ala Phe Thr He Met Glu Arg Val Arg Lys 
1235 1240 1245 

Gly^Leu Trp Leu Lys He^Ser Glu Glu Glu Arg Asn Gly Tyr He Asp 



1260 



Ala^Met Arg Glu Asn Asn Val Pro Asp Trp Tyr He Glu Ser Cys Gly 
1^^° 1275 1280 

Lys He Lys Tyr Met Phe Pro Lys Ala His Ala Ala Ala Tyr Val Leu 
1285 1290 1295 

Met Ala Leu Arg Val Ala Tyr Phe Lys Val His His Pro He Met Tyr 
1300 1305 131Q 

Tyr cys Ala Tyr Phe Ser He Arg Ala Lys Ala Phe Glu Leu Lys Thr 
1315 1320 1325 

Met ser Gly Gly Leu Asp Ala Val Lys Ala Arg Met Glu Asp He Thr 
1330 1335 j3^g 

Ile^Lys Arg Lys Asn^Asn Glu Ala Thr Asn Val Glu Asn Asp Leu Phe 



1350 1355 



1360 



Thr Thr Leu Glu He Val Asn Glu Met Leu Glu Arg Gly Phe Lys Phe 
1365 1370 j3,5 
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Gly Lys Leu Asp Leu Tyr Lys Ser Asp Ala He Glu Phe Gin He Lys 
1380 138S 1390 

Gly Asp Thr Leu He Pro Pro Phe He Ala Leu Glu Gly Leu Gly Glu 
1395 1400 1405 

Asn Val Ala Lys Gin He Val Lys Ala Arg Gin Glu Gly Glu Phe Leu 
1410 1415 1420 

Ser Lys Met Glu Leu Arg Lys Arg Gly Gly Ala Ser Ser Thr Leu Val 
1425 1430 1435 1440 

Glu Lys Met Asp Glu Met Gly He Leu Gly Asn Met Pro Glu Asp Asn 
1445 1450 1455 

Gin Leu Ser Leu Phe Asp Asp Phe Phe 
1460 1465 



The present invention also relates to the dnaE gene of Streptococcus 
pyogenes encoding the a-small subunit. The partial nucleotide sequence of the dnaE 
gene corresponds to SEQ. ID. No. 19 as follows: 



atgtttgctc aacttgatac taaaactgta 
aatcattatt ttgaacgagc aaagcaattt 
gataatcttt atggtgctta ccattttatt 
gttttaggtt tggaaataga gattctctat 
gcccagaata cacaaggcta tcatcagctt 
aagcttcata tggattactt ctgccaacat 
sagggttgga gcgatacatt agtggtccct 
actgatttat ctcatatgga ttctaagagg 
tttgcgcaag atgatatgga aaccctgcac 
ctggcagaga cccctgtggt agaaagtgat 
gccttctatc aaacacactg ccctcaagct 
atctattatg atttcgatac aaatttaaaa 
aagcaagaat tgcaagactt gactgaggct 
ccttatcaat cgcgcttact acatgaattg 
tattttttga ttgtgtggga tttacttcgc 
^tgggacgtg gctcggcggc aggtagtcta 
gatccagttc aacatgattt gctatttgag 
cctgatattg atatcgatct tccagatatt 
aatcgttatg gtagcgacca ttcggcgcaa 
attcgtgatg ttttcaaacg gttcggggtt 
aaaattggtt ttaaagatag cttggctact 
gttattaata gtagaactga atttcaaaag 
aatccaagac aaacgtccat tcacgcagct 
aatcatattc ctctaaaatc gggcgatgac 
gtcgaagcta atggcctgtt aaaaatggat 
caaaaaatgc aagagaaggt tgctaaagac 
gatttagaag acccgcaaac gttggcactt 
caatttgaac aaaatggtgc tattaatctt 
gaaattgttg ccactaccag tctaaataga 
attaaacgaa gagaaggaca agaaaaaatt 
ttagagccaa cttacggtat tatgctttat 
tatgctggtt ttacgttagg caaggccgac 
ctacaagaaa tgcaaaaaat ggaagaagac 
gctgaagaaa cagctagagg actttttaaa 
aaccgcagcc atgcctttgc ctattcagct 
cattacccgg ctgtttttta cgatatcatg 
gatgctctag aatcagattt tcaagtagcg 
gataaaattg aagctagcaa gatttacatg 



tactcattta tggatagttt aattgactta 60 
ggttaccaca ccataggaat catggataag 120 
aaaggttgtc aaaaaaatgg actgcagcca 180 
caagagcggc aggtgctcct taacttaatc 24 0 
ttaaaaattt ccacggcaaa aatgtctggc 300 
ttggaaggga tagcggttat tattcctagt 360 
tttgactact atatgggtgt tgatcagtat 420 
cagcttatac ccctaaggac agttcgttat 480 
atgttgcatg ccattcgaga taacctcagt 540 
caagagttag cagattgtca acaactaacc 600 
ctacagaatt tagaagactt agtgtcagga 660 
ttgcctcatt ttaatagaga taagtctgcc 720 
ggtttgaagg aaaaaggatt gtggaaagag 780 
gtcattattt ctgacatggg ctttgatgat 840 
tttggacgca gtaaaggcta ttatatggga 900 
gtggcttatg ctctgaacat tacagggatt 960 
cgctttttaa acaaagaacg ttatagcatg 1020 
taccgttcag aatttctacg gtatgtccga 1080 
attgtgacct tttcaacctt tggccaggct 1140 
ccagaatacg aactgactaa tctcactaaa 1200 
gtctatgaaa agtcaatctc ttttaggcag 1260 
gcttttgcca ttgccaagcg tatcgaagga 1320 
ggtattgtga tgagtgatga tgccttgacc 1380 
atgatgatca cccagtatga tgctcatgcg 1440 
tttttggggt taagaaattt gacctttgtt 1500 
tacgggtgtc agattgatat tacagccatt 1560 
tttgctaaag gggataccaa gggaattttc 1620 
ttaaaacgga ttaagccaca acgttttgaa 1680 
ccaggggcaa gtgactatac cactaatttc 1740 
gatttgattg atcctgtgat tgctcccatt 1800 
caagaacaag ttatgcagat tgcacaggtt 1860 
ttgttaaggc gtgccatgtc taaaaaaaat 1920 
tttattgctt ctgctaagca cctagggaga 1980 
cggatggaaa aatttgcagg ttatggtttt 204 0 
ttagcttttc aattggctta tttcaaagcc 2100 
atgaattatt ctagcagtga ctatatcaca 2160 
caagttacca ttaatagtat tccttacact 2220 
gggctgaaaa atattaaggg gttgccaagg 2280 
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gattttgctt attggattat cgagcaaaga ccatttaata gcgtagagga ttttctcact 2340 
agaactccag aaaaatatca aaaaaaggtt ttccttgagc ctctgataaa aataggtctg 2400 
tttgattgcc ttgagcctaa ccgtaaaaaa attctggaca acttggatgg tttactggta 2460 
tctgttaatg agcttggttc tcttttttca gattcttcct ttagttgggt agatacgaaa 2520 
gattactcag taactgaaaa atattctttg gaacaggaga tcgttggagt tggcatgagc 2580 
aagcatcctt taattgatat tgctgagaaa agtacccaaa cttttactcc tatttcacag 2640 
ttagtcaaag aaagcgaagc agtcgtactg attcaaatag atagcattag gatcattaga 2700 
accaaaacaa gtgggcagca aatggctttt ttaagtgtga atgacactaa gaaaaagctc 2760 
gatgtcacac tttttccaca agagtatgcc atttataaag accaattaaa agaaggagaa 2820 
ttctattact taaaaggtag aataaaagaa agagaccatc gactgcagat ggtgtgtcag 2880 
caagcgcaaa tggctattag tcaaaaatat tggttattag ttgaaaacca tcagtttgat 2940 
tcccaaattt ctgagatttt aggtgccttt ccaggaacga ctccagttgt tattcactat 3000 
caaaaaaata aggaaacaat tgcattaact aagattcagg ttcatgtaac agagaattta 3060 
aaggaaaaac ttcgtccttt tgttctgaaa acggtttttc ga 3102 

The encoded a-small subunit has an amino acid sequence corresponding to SEQ. ID, 
No. 20 as follows: 



Met Phe Ala Gin Leu Asp Thr Lys Thr Val Tyr Ser Phe Met Asp Ser 
1 5 10 15 

Leu He Asp Leu Asn His Tyr Phe Glu Arg Ala Lys Gin Phe Gly Tyr 
20 25 30 

His Thr He Gly He Met Asp Lys Asp Asn Leu Tyr Gly Ala Tyr His 
35 40 45 

Phe He Lys Gly Cys Gin Lys Asn Gly Leu Gin Pro Val Leu Gly Leu 
50 55 60 

Glu He Glu He Leu Tyr Gin Glu Arg Gin Val Leu Leu Asn Leu He 

70 75 80 

Ala Gin Asn Thr Gin Gly Tyr His Gin Leu Leu Lys He Ser Thr Ala 
85 90 95 

Lys Met Ser Gly Lys Leu His Met Asp Tyr Phe Cys Gin His Leu Glu 
100 105 110 

Gly He Ala Val He He Pro Ser Lys Gly Trp Ser Asp Thr Leu Val 

120 125 

Val Pro Phe Asp Tyr Tyr Met Gly Val Asp Gin Tyr Thr Asp Leu Ser 
130 135 140 

His Met Asp Ser Lys Arg Gin Leu He Pro Leu Arg Thr Val Arg Tyr 

150 155 160 

Phe Ala Gin Asp Asp Met Glu Thr Leu His Met Leu His Ala He Arg 
165 170 175 

Asp Asn Leu Ser Leu Ala Glu Thr Pro Val Val Glu Ser Asp Gin Glu 
180 185 190 

Leu Ala Asp Cys Gin Gin Leu Thr Ala Phe Tyr Gin Thr His Cys Pro 
195 200 205 

Gin Ala Leu Gin Asn Leu Glu Asp Leu Val Ser Gly He Tyr Tyr Asd 
210 215 220 ^ y y 
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Phe Asp 
225 

Lys Gin 

Leu Trp 

He Ser 

Leu Arg 
290 

Ser Ala 
305 

Asp Pro 

Arg Tyr 

Ser Glu 

Ala Gin 
370 

Asp Val 
385 

Thr Lys 
Ser He 
Ala Phe 



He His 
450 



Thr Asn Leu Lys Leu Pro His Phe Asn Arg Asp Lys Ser Ala 
230 235 240 

Glu Leu Gin Asp Leu Thr Glu Ala Gly Leu Lys Glu Lys Gly 
245 250 255 

Lys Glu Pro Tyr Gin Ser Arg Leu Leu His Glu Leu Val He 
260 265 270 

Asp Met Gly Phe Asp Asp Tyr Phe Leu He Val Trp Asp Leu 
275 280 285 

Phe Gly Arg Ser Lys Gly Tyr Tyr Met Gly Met Gly Arg Gly 
295 300 

Ala Gly Ser Leu Val Ala Tyr Ala Leu Asn He Thr Gly He 
310 315 320 

Val Gin His Asp Leu Leu Phe Glu Arg Phe Leu Asn Lys Glu 
325 330 

Ser Met Pro Asp He Asp He Asp Leu Pro Asp He Tyr Arq 
340 345 350 

Phe Leu Arg Tyr Val Arg Asn Arg Tyr Gly Ser Asp His Ser 
355 360 365 

He Val Thr Phe Ser Thr Phe Gly Pro Lys Gin Ala He Arg 
375 380 

Phe Lys Arg Phe Gly Val Pro Glu Tyr Glu Leu Thr Asn Leu 
390 395 400 

Lys He Gly Phe Lys Asp Ser Leu Ala Thr Val Tyr Glu Lys 
405 410 

Ser Phe Arg Gin Val He Asn Ser Arg Thr Glu Phe Gin Lys 
420 425 430 

Ala He Ala Lys Arg He Glu Gly Asn Pro Arg Gin Thr Ser 
435 440 445 

Ala Ala Gly He Val Met Ser Asp Asp Ala Leu Thr Asn His 
455 460 



He Pro 
465 

His Ala 
Arg Asn 
Tyr Gly 



Thr Leu 
530 

Glu Gin 
545 



Leu Lys Ser Gly Asp Asp Met Met He Thr Gin Tyr Asp Ala 
470 475 480 

Val Glu Ala Asn Gly Leu Leu Lys Met Asp Phe Leu Gly Leu 
485 490 

Leu Thr Phe Val Gin Lys Met Gin Glu Lys Val Ala Lys Asp 
500 505 510 

Cys Gin He Asp He Thr Ala He Asp Leu Glu Asp Pro Gin 
515 520 525 

Ala Leu Phe Ala Lys Gly Asp Thr Lys Gly He Phe Gin Phe 
535 540 

Asn Gly Ala He Asn Leu Leu Lys Arg He Lys Pro Gin Arg 
550 555 5gQ 
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Phe Glu Glu lie Val Ala Thr Thr Ser Leu Asn Arg Pro Gly Ala Ser 
565 570 575 

Asp Tyr Thr Thr Asn Phe He Lys Arg Arg Glu Gly Gin Glu Lys He 
580 585 590 

Asp Leu lie Asp Pro Val He Ala Pro He Leu Glu Pro Thr Tyr Gly 
595 600 605 

He Met Leu Tyr Gin Glu Gin Val Met Gin He Ala Gin Val Tyr Ala 
610 615 620 

Gly Phe Thr Leu Gly Lys Ala Asp Leu Leu Arg Arg Ala Met Ser Lys 

630 635 640 

Lys Asn Leu Gin Glu Met Gin Lys Met Glu Glu Asp Phe He Ala Ser 

650 655 

Ala Lys His Leu Gly Arg Ala Glu Glu Thr Ala Arg Gly Leu Phe Lys 
660 665 670 

Arg Met Glu Lys Phe Ala Gly Tyr Gly Phe Asn Arg Ser His Ala Phe 
675 680 685 

Ala Tyr Ser Ala Leu Ala Phe Gin Leu Ala Tyr Phe Lys Ala His Tyr 
690 695 700 

Pro Ala val Phe Tyr Asp He Met Met Asn Tyr Ser Ser Ser Asp Tyr 
■^05 710 715 720 

He Thr Asp Ala Leu Glu Ser Asp Phe Gin Val Ala Gin Val Thr He 
725 730 735 

Asn Ser He Pro Tyr Thr Asp Lys He Glu Ala Ser Lys He Tyr Met 
740 745 750 

Gly Leu Lys Asn He Lys Gly Leu Pro Arg Asp Phe Ala Tyr Trp He 



755 



760 



765 



He Glu Gin Arg Pro Phe Asn . Ser Val Glu Asp Phe Leu Thr Arq Thr 
770 775 780 

Pro Glu Lys Tyr Gin Lys Lys Val Phe Leu Glu Pro Leu He Lys He 
"^Q^ 790 795 800 

Gly Leu Phe Asp Cys Phe Glu Pro Asn Arg Lys Lys He Leu Asp Asn 
805 810 815 

Leu Asp Gly Leu Leu Val Phe Val Asn Glu Leu Gly Ser Leu Phe Ser 
820 825 830 

Asp Ser Ser Phe Ser Trp Val Asp Thr Lys Asp Tyr Ser Val Thr Glu 
835 840 845 

Lys Tyr Ser Leu Glu Gin Glu He Val Gly Val Gly Met Ser Lys His 
850 855 860 



Pro Leu. He Asp He Ala Glu Lys Ser Thr Gin Thr Phe Thr Pro He 
870 875 880 

Ser Gin Leu Val Lys Glu Ser Glu Ala Val Val Leu He Gin He Asp 
885 890 895 
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Ser He Arg lie He Arg Thr Lys Thr Ser Gly Gin Gin Met Ala Phe 
900 905 

Leu Ser Val Asn Asp Thr Lys Lys Lys Leu Asp Val Thr Leu Phe Pro 
915 920 925 

Gin Glu Tyr Ala He Tyr Lys Asp Gin Leu Lys Glu Gly Glu Phe Tvr 

935 ^ 

Tyr Leu Lys Gly Arg He Lys Glu Arg Asp His Arg Leu Gin Met Val 
950 955 960 

Cys Gin Gin Val Gin Met Ala He Ser Gin Lys Tyr Trp Leu Leu Val 
965 970 975 

Glu Asn His Gin Phe Asp Ser Gin He Ser Glu He Leu Gly Ala Phe 
980 985 990 

Pro Gly Thr Thr Pro Val Val He His Tyr Gin Lys Asn Lys Glu Thr 
995 1000 1005 

He Ala Leu Thr Lys He Gin Val Thr Glu Asn Leu Lys Glu Lys Leu 
1010 1015 1020 

Arg Pro Phe Val Leu Lys Thr Val Phe Arq 
1025 .^1030 

The present invention also relates to the holA gene of Streptococcus 
pyogenes encoding the 5 subunit. The holA gene has a nucleotide sequence which 
corresponds to SEQ. ID. No. 21 as follows: 



atgattgcga tagaaaagat tgaaaaactg agtaaagaaa atttgggtct tataaccctt 60 
gtcacaggag atgacattgg tcagtatagc cagttgaaat cccgcttaat ggagcagatt 120 
gcttttgata aggatgattt ggcctattct tactttgata tgtctgaggc cgcttatcag 180 
gatgcagaaa tggatctagt gagcctaccc ttctttgctg agcagaaggt ggttattttt 240 
gaccatttgt tagatatcac gaccaataaa aaaagtttct taaaagaaaa agacctaaag 300 
gcctttgaag cctatttaga aaatccctta gagactactc gactaattat ctttgctcca 360 
.ggtaaattgg atagtaagag acggcttgtt aagcttttga aacgtgatgc ccttgtttta 420 
gaagccaacc ctctgaaaga agcagagcta agaacttatt ttcaaaaata cagtcatcaa 480 
ctgggtttag gtttcgagag tggtgccttt gaccaattac ttttgaaatc aaacgatgat 540 
tttagtcaaa tcatgaaaaa catggccttt ttaaaagcct ataaaaaaac gggaaatatt 600 
agcctaactg atattgagca agccattcct aaaagtttac aagataatat tttcgatctg 660 
actagacttg tcctaggagg taaaattgat gcggctagag atttgattca tgatttacgg 720 
ttatctggag aagatgacat taaattaatc gctatcatgc taggccaatt tcgcttattt 780 
ttgcagctga ctattcttgc tagagatgta aaaaacgagc aacaactagt gattagttta 840 
tcagatattc ttgggcggcg ggttaatcct taccaggtca agtatgcgtt aaaggattct 900 
ctcttgcctt tctaacagga gcggtgaaaa ccttgattga gacagattac 960 
cagataaaaa caggacttta tgagaagagt tatctagttg atattgctct cttaaaaatc 1020 
atgactcact ctcaaaaa 1038 

The encoded 5 subunit has an amino acid sequence corresponding to SEQ. ID. No. 22 
as follows: 



Met He Ala He Glu Lys He Glu Lys Leu Ser Lys Glu Asn Leu Gly 
1 5 10 15 . 
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Leu He Thr Leu Val Thr Gly Asp Asp He Gly Gin Tyr Ser Gin Leu 
2° 25 30 

Lys Ser Arg Leu Met Glu Gin He Ala Phe Asp Lys Asp Asp Leu Ala 
35 40 45 

Tyr Ser Tyr Phe Asp Met Ser Glu Ala Ala Tyr Gin Asp Ala Glu Met 
^° 55 60 

Asp Leu val ser Leu Pro Phe Phe Ala Glu Gin Lys Val Val He Phe 

''O 75 80 

Asp His Leu Leu Asp He Thr Thr Asn Lys Lys Ser Phe Leu Lys Glu 
85 90 95 

Lys Asp Leu Lys Ala Phe Glu Ala Tyr Leu Glu Asn Pro Leu Glu Thr 
100 105 

Thr Arg Leu He He Phe Ala Pro Gly Lys Leu Asp Ser Lys Arg Arg 
115 120 125 

Leu Val Lys Leu Leu Lys Arg Asp Ala Leu Val Leu Glu Ala Asn Pro 
130 135 

Leu Lys Glu Ala Glu Leu Arg Thr Tyr Phe Gin Lys Tyr Ser His Gin 

150 155 3LgQ 

Leu Gly Leu Gly Phe Glu Ser Gly Ala Phe Asp Gin Leu Leu Leu Lys 
165 170 175 

Ser Asn Asp Asp Phe Ser Gin He Met Lys Asn Met Ala Phe Leu Lvs 
180 185 190 

Ala Tyr Lys Lys Thr Gly Asn He Ser Leu Thr Asp He Glu Gin Ala 
195 200 205 

He Pro Lys Ser Leu Gin Asp Asn He Phe Asp Leu Thr Arg Leu Val 
210 215 220 

Leu Gly Gly Lys He Asp Ala Ala Arg Asp Leu He His Asp Leu Arg 

230 235 240 

Leu Ser Gly Glu Asp Asp He Lys Leu He Ala He Met Leu Gly Gin 
245 250 255 

Phe Arg Leu Phe Leu Gin Leu Thr He Leu Ala Arg Asp Val Lys Asn 
260 265 270 

Glu Gin Gin Leu Val He Ser Leu Ser Asp He Leu Gly Arg Arg Val 
275 280 285 

Asn Pro Tyr Gin Val Lys Tyr Ala Leu Lys Asp Ser Arg Thr Leu Ser 

295 300 

Leu Ala Phe Leu Thr Gly Ala Val Lys Thr Leu He Glu Thr Asp Tyr 

310 315 3I0 

Gin He Lys Thr Gly Leu Tyr Glu Lys Ser Tyr Leu Val Asp He Ala 
325 330 335 

Leu Leu Lys He Met Thr His Ser Gin Lys 
340 345 
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10 



15 



20 



25 



30 



35 



40 



50 



55 



The present invention also relates to the holB gene of Streptococcus 
pyogenes encoding the 6* submit. The holB gene has a nucleotide sequence which 
corresponds to SEQ. ID. No. 23 as follows: 



atggatttag cgcaaaaagc tcctaacgct tatcaagctt ttcagacaat tttaaagaaa 60 
gaccgtctga atcatgctta tcttttttcg ggtgattttg ctaatgaaga aatggctctt 120 
tttttagcta aggtcatctt ttgtgaacag aaaaaggatc agacgccctg cgggcattgt 180 
cgctcttgtc aattgattga acaaggagat tttgccgatg tgacggtatt ggaaccaaca 240 
gggcaagtga ttaaaacgga tgtggCcaaa gaaatgatgg ctaacttttc tcagacagga 30O 
tatgaaaaca aacgacaagt ttttattatc aaagattgtg acaaaatgca tatcaatgcc 360 
gctaatagct tgctaaaata cattgaggag cctcagggag aagcttacat atttttattg 420 
accaatgatg ataacaaagt gcttccgacc attaaaagtc ggacacaggt ttttcagttt 480 
cctaaaaacg aagcctatct ttaccaatcg gcacaagaaa agggattatt aaaccatcag 540 
gctaagctag tagccaaact tgccacaaac accagtcatc tagaacgtct gttgcaaacg 600 
agcaagcttt tagaactgat aactcaagca gagcgttttg tatctatttg gctgaaagat 660 
cagttgcagg catatttagc gttgaaccgt ctggtacagt tagcaactga aaaagaagaa 720 
caagatttag ttttgaccct tttgaccttg ctcttggcaa gagagcgtgc gcaaacgcct 780 
ttgacacaat tggaggctgt ctatcaggct aggctcatgt ggcagagcaa tgttaatttt 840 
caaaacacat tagaatatat ggtgatgtca gaa 873 

The encoded 5' subunit has an amino acid sequence corresponding to SEQ. ID. No. 24 
as follows: 

Met Asp Leu Ala Gin Lys Ala Pro Asn Val Tyr Gin Ala Phe Gin Thr 
^5 10 15 

He Leu Lys Lys Asp Arg Leu Asn His Ala Tyr Leu Phe Ser Gly Asp 
20 25 30 

Phe Ala Asn Glu Glu Met Ala Leu Phe Leu Ala Lys Val He Phe Cys 
35 40 45 

Glu Gin Lys Lys Asp Gin Thr Pro Cys Gly His Cys Arg Ser Cys Gin 
50 55 60 

Leu He Glu Gin Gly Asp Phe Ala Asp Val Thr Val Leu Glu Pro Thr 
^5 70 75 80 

Gly Gin Val lie Lys Thr Asp Val Val Lys Glu Met Met Ala Asn Phe 
85 90 95 

Ser Gin Thr Gly Tyr Glu Asn Lys Arg Gin Val Phe He He Lys Asp 
100 105 110 



Cys Asp Lys Met His He Asn Ala Ala Asn Ser Leu Leu Lys Tyr He 
45 115 120 125 



Glu Glu Pro Gin Gly Glu Ala Tyr He Phe Leu Leu Thr Asn Asp Asp 
130 135 140 

Asn Lys Val Leu Pro Thr He Lys Ser Arg Thr Gin Val Phe Gin Phe 
145 150 155 160 

Pro Lys Asn Glu Ala Tyr Leu Tyr Gin Leu Ala Gin Glu Lys Gly Leu 
165 170 175 

Leu Asn His Gin Ala Lys Leu Val Ala Lys Leu Ala Thr Asn Thr Ser 
180 185 190 
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His Leu Glu Arg Leu Leu Gin Thr Ser Lys Leu Leu Glu Leu lie Thr 
155 200 205 

Gin Ala Glu Arg Phe Val Ser lie Trp Leu Lys Asp Gin Leu Gin Ala 
210 215 220 

Tyr Leu Ala Leu Asn Arg Leu Val Gin Leu Ala Thr Glu Lys Glu Glu 

230 235 240 

Gin Asp Leu Val Leu Thr Leu Leu Thr Leu Leu Leu Ala Arg Glu Arg 
245 250 255 

Ala Gin Thr Pro Leu Thr Gin Leu Glu Ala Val Tyr Gin Ala Arg Leu 
260 265 270 

Met Trp Gin Ser Asn Val Asn Phe Gin Asn Thr Leu Glu Tyr Met Val 
275 280 285 

Met Ser Glu 
290 



The present invention also relates to the dnaX gene of Streptococcus 
pyogenes encoding the x subunit. The dnaX gene has a nucleotide sequence which 
corresponds to SEQ. ID. No. 25 as follows: 



atgtatcaag ctctttatcg gaaataccgg 
tcggttattt ccacaacttt aaagcaggca 
cttttttcag gtcctagagg gactgggaaa 
atgaattgtc ctaaccaagt cgatggtgaa 
atcacgaatg gaagcttgga agatgtgatt 
gatgaaattc gtgacattcg agacaaatca 
gtttatatta ttgatgaggt tcacatgtta 
actttggaag aaccgacaga atgttgtctt 
ccagccacta ttttatctcg tgtgcaacgc 
attcgagagc atttagcctg ggttttggac 
ttaaatctca ttgcaaggcg agcagaagga 
caggctttga gcttgtcacc agataatcag 
ggttctattt ccatacttgc tctgggtgac 
acgcaagctc tggcagcctt agagaccatt 
gcgacagatt tattgaccta tctgcgtgat 
caacgtcagt cagctgtttt tgataccaat 
atgataacag ttgttactag tcatctccct 
tatgccgaaa tgatgactat ccaattagct 
ttgtcaggag agttaatctc agagattgaa 
caacaattgt cgcagctcca atcgcgtcct 
cctaaaacca caagctacag ggttgatcgg 
gttcgaaata gccaacaatc tcgacaatat 
attctagata acatttctgc ccaagacaga 
gcaaatagtg agaatgcgat tttggctttc 
agccgaaata atcttaatga tatgtttggt 
cccaatattc tggcagtacc aaggacagat 
caaatgaaat cgcaaaaaga cagtgttcaa 
gaagggtttg attttttgct cgataaaata 



agccaaacgt ttgacgaaat ggtgggacaa 60 
gttgaatctg gcaagattag ccatgcttat 120 
acaagtgcgg caaagatttt tgcaaaggcc 180 
ccctgtaatc aatgcgatat ttgccgagat 24 0 
gaaattgatg ctgcctcgaa taatggtgtt 300 
acctatgcgc caagtcgtgc gacttacaag 360 
tcaacagggg cttttaatgc gcttttgaaa 420 
tatcttggca acaacggaat gcataaaatt 480 
tttgaattca aagctattaa gcaaaaagct 540 
aaagaaggta ttgcctatga ggtggatgct 600 
ggcatgcgtg atgctttatc tattttagat 660 
gtcgccattg caattgccga agaaattaca 720 
tatgttcgat atgtctccca agaacaggct 780 
tatgatagtg ggaagagcat gagccgcttt 840 
ttattggtgg ttaaagctgg cggcgacaat 900 
ttgtctctct cgatagatcg tatattccaa 960 
gaaatcaaaa agggaaccca tcctcggatt 1020 
cagaaagagc agattttgtc ccaagtaaac 1080 
acgctcaaaa atgagttggc acaacttaaa 1140 
gattcactgg caagatctga taaaacgaaa 1200 
gttaccattt tgaaaatcat ggaagaaacg 1260 
ctagatgctc taaaaaatgc ttggaatgaa 1320 
gccttattga tgggctctga gcctgtctta 1380 
gaggctgcct ttaatgcaga acaagtcatg 144 0 
aacattatga gtaaagctgc tggtttttct 1500 
tttcagcata ttcgtaagga atttgctcag 1560 
gaagaacaag aagtagcgct tgatattcca 1620 
aatactattg acgac 1665 



The encoded x subunit has an amino acid sequence corresponding to SEQ. ID. No. 26 
as follows: 
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Met Tyr Gin Ala Leu Tyr Arg Lys Tyr Arg Ser Gin Thr Phe Asp Glu 
15 10 15 

Met Val Gly Gin Ser Val lie Ser Thr Thr Leu Lys Gin Ala Val Glu 
20 25 30 

Ser Gly Lys He Ser His Ala Tyr Leu Phe Ser Gly Pro Arg Gly Thr 
35 40 45 

Gly Lys Thr Ser Ala Ala Lys He Phe Ala Lys Ala Met Asn Cys Pro 
50 55 60 

Asn Gin Val Asp Gly Glu Pro Cys Asn Gin Cys Asp He Cys Arg Asp 
65 70 75 80 

He Thr Asn Gly Ser Leu Glu Asp Val He Glu He Asp Ala Ala Ser 
85 90 95 

Asn Asn Gly Val Asp Glu He Arg Asp He Arg Asp Lys Ser Thr Tyr 
100 105 110 

Ala Pro Ser Arg Ala Thr Tyr Lys Val Tyr He He Asp Glu Val His 
115 120 125 

Met Leu Ser Thr Gly Ala Phe Asn Ala Leu Leu Lys Thr Leu Glu Glu 
130 135 140 

Pro Thr Glu Asn Val Phe He Leu Ala Thr Thr Glu Leu His Lys He 
145 150 155 160 

Pro Ala Thr He Leu Ser Arg Val Gin Arg Phe Glu Phe Lys Ala He 
165 170 175 

Lys Gin Lys Ala He Arg Glu His Leu Ala Trp Val Leu Asp Lys Glu 
180 185 190 

Gly He Ala Tyr Glu Val Asp Ala Leu Asn Leu He Ala Arg Arg Ala 
195 200 205 

Glu Gly Gly Met Arg Asp Ala Leu Ser He Leu Asp Gin Ala Leu Ser 
210 215 220 

Leu Ser Pro Asp Asn Gin Val Ala He Ala He Ala Glu Glu He Thr 
225 230 235 240 

Gly Ser He Ser He Leu Ala Leu Gly Asp Tyr Val Arg Tyr Val Ser 
245 250 255 

Gin Glu Gin Ala Thr Gin Ala Leu Ala Ala Leu Glu Thr He Tyr Asp 
260 265 270 

Ser Gly Lys Ser Met Ser Arg Phe Ala Thr Asp Leu Leu Thr Tyr Leu 
275 280 285 

Arg Asp Leu Leu Val Val Lys Ala Gly Gly Asp Asn Gin Arg Gin Ser 
290 295 300 

Ala Val Phe Asp Thr Asn Leu Ser Leu Ser He Asp Arg He Phe Gin 
305 310 315 320 

Met He Thr Val Val Thr Ser His Leu Pro Glu He Lys Lys Gly Thr 
325 330 335 
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His Pro Arg He Tyr Ala Glu Met Met Thr He Gin Leu Ala Gin Lys 

345 350 

Glu Gin He Leu Ser Gin Val Asn Leu Ser Gly Glu Leu He Sex Glu 
355 360 

He Glu Thr Leu Lys Asn Glu Leu Ala Gin Leu Lys Gin Gin Leu Ser 
370 375 380 

Gin Leu Gin Ser Arg Pro Asp Ser Leu Ala Arg Ser Asp Lys Thr Lys 

395 400 

Pro Lys Thr Thr Ser Tyr Arg Val Asp Arg Val Thr He Leu Lys He 
405 410 415 

Met Glu Glu Thr Val Arg Asn Ser Gin Gin Ser Arg Gin Tyr Leu Asp 
420 425 430 

Ala Leu Lys Asn Ala Trp Asn Glu He Leu Asp Asn He Ser Ala Gin 
435 440 

Asp Arg Ala Leu Leu Met Gly Ser Glu Pro Val Leu Ala Asn Ser Glu 
450 455 

Asn Ala He Leu Ala Phe Glu Ala Ala Phe Asn Ala Glu Gin Val Met 

475 480 

Ser Arg Asn Asn Leu Asn Asp Met Phe Gly Asn He Met Ser Lys Ala 
485 490 

Ala Gly Phe Ser Pro Asn He Leu Ala Val Pro Arg Thr Asp Phe Gin 
500 505 510 

His He Arg Lys Glu Phe Ala Gin Gin Met Lys Ser Gin Lys Asp Ser 
515 520 525 

Val Gin Glu Glu Gin Glu Val Ala Leu Asp He Pro Glu Gly Phe Asp 

535 540 

Phe Leu Leu Asp Lys He Asn Thr He Asp Asp 
545 550 555 

The present invention also relates to the dnaNgmc of Streptococcus 
pyogenes encoding the p subunit. The dnaN gene has a nucleotide sequence which 
corresponds to SEQ. ID. No. 27 as follows: 



atgattcaat tttcaattaa tcgcacatta 
gctattagca ctaaaaatgc cattcctatt 
acaggagtaa ctttaacagg gtctaacggt 
agtaatgaaa atgctggttt gctaattacc 
ttttttatta atattatttc aagtttgcca 
caacaccaag tcgttttaac cagtggtaaa 
gaccagtatc ctcgtctaca agaagtatca 
ttattgaagt ctattattgc tgaaacagct 
attttaacag gagttcatat tgtattaagt 
gactctcatc gtatgagcca acgtttaatc 
gtagttcttc caagtaaatc tttgagagaa 
accgttgagg tatttttctc accaagccaa 



tttattcatg ctttaaatac aactaaacgt 60 
ctttcatcaa taaaaattga agtcacttct 120 
caaatatcaa ttgaaaacac tattcctgta 180 
tctccaggag ctattttatt agaagctagt 24 0 
gatattagta taaatgttaa agaaattgaa 300 
tcagagatta ccttaaaagg aaaagatgtt 360 
acagaaaatc ctttgatttt aaaaacaaaa 420 
tttgcagcca gtttacaaga aagtcgtcct 480 
aatcataaag attttaaagc agtagcgact 54 0 
actttggaca atacttcagc agatttgatg 500 
ttttcagcag tatttacaga tgatattgag 660 
atcttgttca gaagtgaaca catttctttt 720 
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tatacacgcc tcttagaagg aaattatccc gatacagacc gtttattaat gacagaattt 780 
gagacggagg ttgttttcaa tacccaatcc cttcgccacg ctatggaacg tgccttcttg 840 
atttctaatg ctactcaaaa tggtactgtt aagcttgaga ttactcaaaa tcatatttca 900 
gctcatgtta actcacctga ggttggtaag gtaaacgagg atttagatat tgttagtcag 960 
tctggtagtg atttaactat cagcttcaat ccaacttacc ttattgagtc tttaaaagct 1020 
attaaaagtg aaacagtaaa aattcatttc ttatcaccag ttcgaccatt caccctaaca 1080 
ccaggcgatg aggaagaaag ttttatccaa ttaattacac cagtacgaac aaac 1134 

The encoded p subunit has an amino acid sequence corresponding to SEQ. ID. No. 28 
as follows: 



Met He Gin Phe Ser He Asn Arg Thr Leu- Phe He His Ala Leu Asn 
^5 10 15 

Thr Thr Lys Arg Ala He Ser Thr Lys Asn Ala He Pro He Leu Ser 
20 25 30 

Ser He Lys He Glu Val Thr Ser Thr Gly Val Thr Leu Thr Gly Ser 
35 40 45 

Asn Gly Gin He Ser He Glu Asn Thr He Pro Val Ser Asn Glu Asn 
50 55 60 

Ala Gly Leu Leu He Thr Ser Pro Gly Ala He Leu Leu Glu Ala Ser 

70 75 80 

Phe Phe He Asn He He Ser Ser Leu Pro Asp He Ser He Asn Val 
85 90 95 

Lys Glu He Glu Gin His Gin Val Val Leu Thr Ser Gly Lys Ser Glu 
100 105 110 

He Thr Leu Lys Gly Lys Asp Val Asp Gin Tyr Pro Arg Leu Gin Glu 
115 120 125 

Val Ser Thr Glu Asn Pro Leu He Leu Lys Thr Lys Leu Leu Lys Ser 
130 135 140 

He He Ala Glu Thr Ala Phe Ala Ala Ser Leu Gin Glu Ser Arg Pro 
150 155 160 

He Leu Thr Gly Val His He Val Leu Ser Asn His Lys Asp Phe Lys 
165 170 175 

Ala Val Ala Thr Asp Ser His Arg Met Ser Gin Arg Leu He Thr Leu 
180 185 190 

Asp Asn Thr Ser Ala Asp Leu Met Val Val Leu Pro Ser Lys Ser Leu 
195 200 205 

Arg Glu Phe Ser Ala Val Phe Thr Asp Asp He Glu Thr Val Glu Val 
210 215 220 

Phe Phe Ser Pro Ser Gin He Leu Phe Arg Ser Glu His He Ser Phe 
230 235 240 

Tyr Thr Arg Leu Leu Glu Gly Asn Tyr Pro Asp Thr Asp Arg Leu Leu 
245 250 255 

Met Thr Glu Phe Glu Thr Glu Val Val Phe Asn Thr Gin Ser Leu Arg 
260 265 270 
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His Ala Met Glu Arg Ala 
275 

Thr Val Lys Leu Glu lie 
290 

Ser Pro Glu Val Gly Lys 
305 310 

Ser Gly Ser Asp Leu Thr 
325 

Ser Leu Lys Ala lie Lys 
340 

Pro Val Arg Pro Phe Thr 
355 

He Gin Leu He Thr Pro 
370 



Phe Leu He Ser Asn 
280 

Thr Gin Asn His He 
295 

Val Asn Glu Asp Leu 
315 

He Ser Phe Asn Pro 
330 

Ser Glu Thr Val Lys 
345 

Leu Thr Pro Gly Asp 
360 

Val Arg Thr Asn 
375 



Ala Thr Gin Asn Gly 
285 

Ser Ala His Val Asn 
300 

Asp He Val Ser Gin 
320 

Thr Tyr Leu He Glu 
335 

He His Phe Leu Ser 
350 

Glu Glu Glu Ser Phe 
365 



The present invention also relates to the ssb gene of Streptococcus 
pyogenes encoding the single strand-binding protein (SSB). The ssb gene has a 
nucleotide sequence which corresponds to SEQ. ID. No. 29 as follows: 



atgattaata atgtagtact agttggtcgc atgaccaagg atgcagaact tcgttacaca 60 
ccaagtcaag tagctgtggc taccttcaca cttgctgtta accgtacctt taaaagccaa 120 
aatggtgaac gcgaggcaga tttcattaac tgtgtgatct ggcgtcaacc ggctgaaaat 180 
ttagcgaact gggctaaaaa aggtgctttg atcggagtta cgggtcgtat tcatacacgt 24 0 
aactacgaaa accaacaagg acaacgtgtc tatgtaacag aagttgttgc agataatttc 300 
caaatgttgg aaagtcgtgc tacacgtgaa ggtggctcaa ctggctcatt taatggtggt 3 60 
tttaacaata acacttcatc atcaaacagt tactcagcgc ctgcacaaca aacgcctaac 420 
tttggaagag atgatagccc atttgggaac tcaaacccga tggatatctc agatgacgat 480 
cttccattct ag 



The encoded SSB protein has an amino acid sequence corresponding to SEQ. ID. 
No. 30 as follows: 



Met He Asn Asn Val Val Leu Val Gly Arg Met Thr Lys Asp Ala Glu 
1 5 10 15 

Leu Arg Tyr Thr Pro Ser Gin Val Ala Val Ala Thr Phe Thr Leu Ala 
20 25 30 

Val Asn Arg Thr Phe Lys Ser Gin Asn Gly Glu Arg Glu Ala Asp Phe 
35 40 45 

He Asn Cys Val He Trp Arg Gin Pro Ala Glu Asn Leu Ala Asn Trp 
50 55 60 

Ala Lys Lys Gly Ala Leu He Gly Val Thr Gly Arg He Gin Thr Arg 
^5 70 75 80 

Asn Tyr Glu Asn Gin Gin Gly Gin Arg Val Tyr Val Thr Glu Val Val 
85 90 95 
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Ala Asp Asn Phe Gin Met Leu Glu Ser Arg Ala Thr Arg Glu Gly Gly 
100 105 110 

5 Ser Thr Gly Ser Phe Asn Gly Gly Phe Asn Asn Asn Thr Ser Ser Ser 

115 120 125 

Asn Ser Tyr Ser Ala Pro Ala Gin Gin Thr Pro Asn Phe Gly Arg Asp 
130 135 140 

10 

Asp Ser Pro Phe Gly Asn Ser Asn Pro Met Asp lie Ser Asp Asp Asp 
145 150 155 160 

Leu Pro Phe 

15 



The present invention also relates to the dnaG gene of Streptococcus 
pyogenes encoding the primase. The dnaG gene has a nucleotide sequence which 
corresponds to SEQ. ID. No. 31 as follows: 

20 

acgggatttt tatggggagg tgacgatttg gcaattgaca aagaaatgat ttcccaagta 60 
aaaaatagcg ttaatattgt cgatgtcatt ggagaagtgg tcaaactttc ccgatcaggg 120 
cggcattacc tcgggctttg cccatttcat aaggaaaaga caccctcttt taatgttgtt 180 
gaagacagac aattttttca ctgcttCggc tgtggaaaat caggggatgt ttttaaattt 240 

25 attgaggaat accgccaagt ccccttctta gaaagtgttc agattattgc cgataagact 300 

ggtatgtcgc ttaatatacc gccaagtcag gcagtacttg ctagccaaca caagcaccct 360 
aatcacgctt tgatgacact tcatgaggat gctgctaaat tttaccatgc agttttgatg 420 
accactacca ttggtcaaga agctaggaag tacctttacc agagaggctt ggatgaccaa 4 80 
ttaattgagc atttcaatat tggtttagcc ccagatgagt cagattatct ttatcaagct 540 

30 ctttctaaaa aatacgagga aggtcaattg gttgcttcag gattgtttca cttgtccgat 600 

caatccaata ccatttacga cgcctttcga aatcgtatca tgtttccctt atcagatgac 660 
cgagggcata ttattgcctt ttcaggacgt atctggacgg cagctgatat ggaaaagaga 720 
caggcaaagt ataaaaattc aagaggaaca gttcttttta acaaatctta tgaattgtat 780 
catctggaca aggcaaggcc tgttattgcc aaaacccatg aagtgtttct aatggaaggg 840 

35 tttatggacg tgattgccgc ttaccgttcc ggctatgaaa atgctgttgc ttcaatgggg 900 

acggcattga ctcaagaaca tgtcaatcac cttaagcaag tcactaaaaa agttgttttg 960 
atttatgatg gtgacgatgc tggacaacat gctattgcaa aatcactaga attgcttaaa 1020 
gattttgttg tcgaaattgt cagaatcccc aataaaatgg atcctgacga atttgtacaa 1080 
cggcattccc cagaagcatt tgcagatttg cttaagcagt cacggatcag tagtgttgaa 1140 

40 ttttttattg attacctaaa acctactaat gtagacaatt tgcaatcaca aattgtttat 1200 

gtggagaaaa tggcaccatt gattgctcaa tcaccatcca tcacagctca acattcgtat 1260 
attaacaaga ttgctgattt gttgccaaac tttgactatt ttcaagtaga acaatcagta 1320 
aatgcattaa ggattcaaga taggcaaaaa catcaaggtc aaatagctca agccgtcagc 1380 
aatcttgtga ccttaccaat gccaaaaagt ttgacagcta ttgctaagac agaaagtcat 1440 

45 ctcatgcatc ggctcttaca tcatgactat ttattaaatg aatttcgaca tcgtgatgat 1500 

ttttattttg atacctctac cttagaatta ctttatcaac ggctgaagca acaaggacac 1560 
attacatctt atgatttgtc agagatgtca gaggaagtta accgtgctta ttacaatgtt 1620. 
ttagaagaaa accttcccaa agaagtagct cttggtgaga ttgatgatat tttatccaaa 1680 
cgtgccaaac ttttagcaga gcgcgatctt cacaaacaag ggaaaaaagt tagagaatct 1740 

50 agtaacaaag gcgatcatca agcggctcta gaagtactag aacattttat tgcgcagaaa 1800 

cgaaaaatgg aatag 1815 

The encoded primase has an amino acid sequence corresponding to SEQ. ID. No. 32 
as follows: 

55 
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Met Gly Phe Leu 
1 

He Ser Gin Val 
20 

Val Val Lys Leu 

35 

Phe His Lys Glu 
50 

Phe Phe His Cys 
65 

He Glu Glu Tyr 



Ala Asp Lys Thr 
100 

Leu Ala Ser Gin 
115 

Glu Asp Ala Ala 
130 

Gly Gin Glu Ala 
145 

Leu He Glu His 



Leu Tyr Gin Ala 
180 

Ser Gly Leu Phe 
195 

Phe Arg Asn Arg 
210 

He Ala Phe Ser 
225 

Gin Ala Lys Tyr 



Tyr Glu Leu Tyr 
260 

His Glu Val Phe 
275 

Arg Ser Gly Tyr 
290 

Gin Glu His Val 
305 

He Tyr Asp Gly 



Trp Gly Gly Asp 
5 



Lys Asn Ser Val 



Ser Arg Ser Gly 
40 

Lys Thr Pro Ser 
55 

Phe Gly Cys Gly 

70 

Arg Gin Val Pro 
85 

Gly Met Ser Leu 



His Lys His Pro 
120 

Lys Phe Tyr His 
135 



Arg Lys Tyr Leu 
150 

Phe Asn He Gly 
165 



Leu Ser Lys Lys 



His Leu Ser Asp 
200 

He Met Phe Pro 
215 

Gly Arg He Trp 
230 



Lys Asn Ser Arg 
245 



His Leu Asp Lys 



Leu Met Glu Gly 
280 

Glu Asn Ala Val 
295 

Asn His Leu Lys 
310 



Asp Asp Ala Gly 
325 



Asp Leu Ala He 
10 



Asn He Val Asp 
25 



Arg His Tyr Leu 



Phe Asn Val Val 
60 

Lys Ser Gly Asp 
75 

Phe Leu Glu Ser 
90 

Asn He Pro Pro 
105 

Asn His Ala Leu 



Ala Val Leu Met 
140 

Tyr Gin Arg Gly 
155 

Leu Ala Pro Asp 
170 



Tyr Glu Glu Gly 
185 



Gin Ser Asn Thr 



Leu Ser Asp Asp 
220 

Thr Ala Ala Asp 
235 

Gly Thr Val Leu 
250 

Ala Arg Pro Val 
265 



Phe Met Asp Val 



Ala Ser Met Gly 
300 

Gin Val Thr Lys 
315 

Gin His Ala He 
330 



Asp Lys Glu Met 
15 

Val He Gly Glu 
30 

Gly Leu Cys Pro 
45 

Glu Asp Arg Gin 



Val Phe Lys Phe 
80 

Val Gin He He 
95 

Ser Gin Ala Val 
110 



Met Thr Leu His 
125 

Thr Thr Thr He 



Leu Asp Asp Gin 
160 

Glu Ser Asp Tyr 
175 

Gin Leu Val Ala 
190 

He Tyr Asp Ala 
205 



Arg Gly His He 



Met Glu Lys Arg 
240 

Phe Asn Lys Ser 
255 

He Ala Lys Thr 
270 

He Ala Ala Tyr 
285 



Thr Ala Leu Thr 



Lys Val Val Leu 
320 

Ala Lys Ser Leu 
335 
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Glu Leu Leu Lys Asp Phe Val Val Glu lie Val Arg lie Pro Asn Lys 
340 345 350 

Met Asp Pro Asp Glu Phe Val Gin Arg His Ser Pro Glu Ala Phe Ala 
355 360 365 

Asp Leu Leu Lys Gin Ser Arg He Ser Ser Val Glu Phe Phe He Asp 
370 375 380 

Tyr Leu Lys Pro Thr Asn Val Asp Asn Leu Gin Ser Gin He Val Tyr 
385 390 395 400 

Val Glu Lys Met Ala Pro Leu He Ala Gin Ser Pro Ser He Thr Ala 
405 410 415 

Gin His Ser Tyr He Asn Lys He Ala Asp Leu Leu Pro Asn Phe Asp 
420 425 430 

Tyr Phe Gin Val Glu Gin Ser Val Asn Ala Leu Arg He Gin Asp Arg 
435 440 445 

Gin Lys His Gin Gly Gin He Ala Gin Ala Val Ser Asn Leu Val Thr 
450 455 460 

Leu Pro Met Pro Lys Ser Leu Thr Ala He Ala Lys Thr Glu Ser His 
465 470 475 480 

Leu Met His Arg Leu Leu His His Asp Tyr Leu Leu Asn Glu Phe Arg 
485 490 495 

His Arg Asp Asp Phe Tyr Phe Asp Thr Ser Thr Leu Glu Leu Leu Tyr 
500 505 510 

Gin Arg Leu Lys Gin Gin Gly His He Thr Ser Tyr Asp Leu Ser Glu 
515 520 525 

Met Ser Glu Glu Val Asn Arg Ala Tyr Tyr Asn Val Leu Glu Glu Asn 
530 535 540 

Leu Pro Lys Glu Val Ala Leu Gly Glu He Asp Asp He Leu Ser Lys 
545 550 555 560 

Arg Ala Lys Leu Leu Ala Glu Arg Asp Leu His Lys Gin Gly Lys Lys 
565 570 575 

Val Arg Glu Ser Ser Asn Lys Gly Asp His Gin Ala Ala Leu Glu Val 
580 585 590 

Leu Glu His Phe He Ala Gin Lys 
595 600 



The present invention also relates to the dnaB gene of Streptococcus 
pyogenes encoding DnaB. The dnaB gene has a nucleotide sequence which 
corresponds to SEQ. ID. No. 33 as follows: 



atgaggttgc ctgaagtagc tgaattacga 
tctgttcttg ggtcaatctt tatctcacct 
agtccagacg atttttataa gtacgctcat 
agcgatcgta atgatgccat tgatgcaacc 



gttcaacccc aagatttact agcagagcaa 60 
gataagctga ttgcagtgag agaatttatc 120 
aaaattatct ttcgggcaat gattaccctc 180 
actataagaa caatcctaga tgatcaagat 24 0 
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gatctgcaaa gtattggtgg cttatcctat attgttgaac tagttaatag tgtcccaact 300 

agtgctaatg cagaatatta tgctaaaatt gtagctgaga aagctatgtt gcgtgatatt 360 

attgctaggt tgacagaatc tgtcaaccta gcttatgatg aaattttaaa accagaagag 420 

gttatcgctg gagttgagag agctttaatt gaactcaatg aacatagtaa ccgtagtggg 480 

5 tttcgcaaaa tttcagatgt gctaaaagtt aattacgagg ctttagaagc acgttctaag 540 

cagacttcaa atgttacagg tttaccaact ggttttagag accttgacaa gattacaaca 600 

ggtttacacc cagatcaatt agttatttta gctgctcggc cagcagtggg gaagactgcc 660 

tttgttctta atattgcgca aaatgtgggg actaagcaaa aaaagactgt tgctattttt 720 

tctttggaaa tgggtgctga aagtttagta gatcgtatgc ttgcagcaga aggaatggtt 780 

10 gattcgcaca gtttaagaac agggcaactc acagatcagg attggaataa tgtaacaatt 840 

gctcagggag ctttggcaga agcaccgatt tatattgacg atacgcccgg gattaaaatt 900 

actgaaatcc gcgcaagatc acggaaattg tctcaagaag tggatggtgg tttaggtctc 960 

attgtaattg actacttaca gttgattaca ggaactaaac ccgaaaatcg tcagcaagag 1020 

gtttcagata tttcaagaca gcttaaaatc ctagctaaag aattgaaagt accagttatt 1080 

15 gccctaagtc agctttctcg tggcgttgag caaaggcaag ataaacgacc agttttatca 1140 

gatattcgtg aatcaggatc cactgagcag gatgccgata ttgtagcctt cttataccgg 1200 

gacgattatt accgtaaaga atgtgatgat gctgaagaag ctgttgaaga taacacaatt 1260 

gaagttatcc tcgagaaaaa tagagctggg gcgcgtggaa cagtcaaact gatgttccaa 1320 

aaagaataca acaaattctc aagtatagcc cagtttgaag aaagataa 1368 



20 



The encoded DnaB has an amino acid sequence corresponding to SEQ. ID. No. 34 as 
follows: 



Met Arg Leu Pro Glu Val Ala Glu Leu Arg Val Gin Pro Gin Asp Leu 

25 1 5 10 15 

Leu Ala Glu Gin Ser Val Leu Gly Ser lie Phe lie Ser Pro Asp Lys 

20 25 30 

30 Leu lie Ala Val Arg Glu Phe lie Ser Pro Asp Asp Phe Tyr Lys Tyr 

35 40 45 



35 



50 



Ala His Lys lie lie Phe Arg Ala Met lie Thr Leu Ser Asp Arg Asn 
50 55 60 

Asp Ala lie Asp Ala Thr Thr lie Arg Thr lie Leu Asp Asp Gin Asp 

65 70 75 80 



Asp Leu Gin Ser lie Gly Gly Leu Ser Tyr lie Val Glu Leu Val Asn 
40 85 90 95 

Ser Val Pro Thr Ser Ala Asn Ala Glu Tyr Tyr Ala Lys lie Val Ala 
100 105 110 

45 Glu Lys Ala Met Leu Arg Asp He He Ala Arg Leu Thr Glu Ser Val 

115 120 125 



Asn Leu Ala Tyr Asp Glu He Leu Lys Pro Glu Glu Val He Ala Gly 
130 135 140 

Val Glu Arg Ala Gin Gly Ala Leu Ala Glu Ala Pro He Tyr He Asp 

145 150 155 160 



Asp Thr Pro Gly He Lys He Ala Leu He Glu Leu Asn Glu His Ser 
55 165 170 175 

Asn Arg Ser Gly Phe Arg Lys He Ser Asp Val Leu Lys Val Asn Tyr 
180 185 190 

60 Glu Ala Leu Glu Ala Arg Ser Lys Gin Thr Ser Asn Val Thr Gly Leu 

195 200 205 
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10 



20 



25 



35 



40 



45 



Pro Thr Gly Phe Arg Asp Leu Asp Lys He Thr Thr Gly Leu His Pro 
210 215 220 

Asp Gin Leu Val He Leu Ala Ala Arg Pro Ala Val Gly Lys Thr Ala 
225 230 235 240 

Phe Val Leu Asn He Ala Gin Asn Val Gly Thr Lys Gin Lys Lys Thr 
245 250 255 

Val Ala He Phe Ser Leu Glu Met Gly Ala Glu Ser Leu Val Asp Arg 
260 265 270 



Met Leu Ala Ala Glu Gly Met Val Asp Ser His Ser Leu Arg Thr Gly 
275 280 285 



Gin Leu Thr Asp Gin Asp Trp Asii Asn Val Thr He Thr Glu He Arg 
290 295 300 

Ala Arg Ser Arg Lys Leu Ser Gin Glu Val Asp Gly Gly Leu Gly Leu 
305 310 315 320 

He Val He Asp Tyr Leu Gin Leu He Thr Gly Thr Lys Pro Glu Asn 
325 330 335 

Arg Gin Gin Glu Val Ser Asp He Ser Arg Gin Leu Lys He Leu Ala 
340 345 350 



Lys Glu Leu Lys Val Pro Val He Ala Leu Ser Gin Leu Ser Arg Gly 
30 355 360 365 



Val Glu Gin Arg Gin Asp Lys Arg Pro Val Leu Ser Asp He Arg Glu 
370 375 380 

Ser Gly Ser He Glu Gin Asp Ala Asp He Val Ala Phe Leu Tyr Arg 
385 390 395 400 

Asp Asp Tyr Tyr Arg Lys Glu Cys Asp Asp Ala Glu Glu Ala Val Glu 
405 410 415 

Asp Asn Thr He Glu Val He Leu Glu Lys Asn Arg Ala Gly Ala Arg 
420 425 430 

Gly Thr Val Lys Leu Met Phe Gin Lys Glu Tyr Asn Lys Phe Ser Ser 
435 440 445 

He Ala Gin Phe Glu Glu Arg 
450 455 



50 Fragments of the above polypeptides or proteins are also encompassed 

by the present invention. 

Suitable fragments can be produced by several means. In the first, 
subclones of the gene encoding the protein of the present invention are produced by 
conventional molecular genetic manipulation by subcloning gene fragments. The 

55 subclones then are expressed in vitro or in vivo in bacterial cells to yield a smaller 
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protein or peptide that can be tested for activity according to the procedures described 
below. 

As an alternative, fragments of replication proteins can be produced by 
digestion of a full-length replication protein with proteolytic enzymes like 
chymotrypsin or Staphylococcus proteinase A, or trypsin. Different proteolytic 
enzymes are likely to cleave replication proteins at different sites based on the amino 
acid sequence of the protein. Some of the fragments that result from proteolysis may 
be active and can be tested for activity as described below. 

In another approach, based on knowledge of the primary structure of 
the protein, fragments of a replication protein gene may be synthesized by using the 
PGR technique together with specific sets of primers chosen to represent particular 
portions of the protein. These then would be cloned into an appropriate vector for 
increased expression of a truncated peptide or protein. 

Chemical synthesis can also be used to make suitable fragments. Such 
a synthesis is carried out using known amino acid sequences of replication proteins 
being produced. Alternatively, subjecting a full length replication protein to high 
temperatures and pressures will produce fragments. These fragments can then be 
separated by conventional procedures (e.g., chromatography, SDS-PAGE). 

Variants may also (or alternatively) be modified by, for example, the 
deletion or addition of amino acids that have minimal influence on the properties, 
secondary structure, and hydropathic nature of the polypeptide. For example, a 
polypeptide may be conjugated to a signal (or leader) sequence at the N-terminal end 
of the protein which cotranslationally or post-translationally directs transfer of the 
protein. The polypeptide may also be conjugated to a linker or other sequence for 
ease of synthesis, purification, or identification of the polypeptide. 

Suitable DNA molecules are those that hybridize to a DNA molecule 
comprising a nucleotide sequence of at least about 20, more preferably at least about 
30 to about 50, continuous bases of either SEQ. ID. Nos. 1, 3, 5, 7, 9, 11,13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, or 33 under stringent conditions such as those characterized 
by a hybridization buffer comprising 0.9M sodium citrate ("SSC") buffer at a 
temperature of about 37°C and remaining bound when subject to washing the SSC 
buffer at a temperature of about 37T; and preferably in a hybridization buffer 
comprising 20% formamide in 0.9M SSC buffer at a temperature of about 42*»C and 
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remaining bound when subject to washing at about 42°C with 0.2x SSC buffer. 
Stringency conditions can be further varied by modifying the temperature and/or salt 
content of the buffer, or by modifying the length of the hybridization probe. 

The proteins or polypeptides of the present invention are preferably 
5 produced in purified form (preferably at least 80%, more preferably 90%, pure) by 
conventional techniques. Typically, the proteins or polypeptides of the present 
invention is secreted into the growth medium of recombinant host cells. 
Alternatively, the proteins or polypeptides of the present invention are produced but 
not secreted into growth medium. In such cases, to isolate the protein, the host cell 

10 (e.g., £. coli) carrying a recombinant plasmid is propagated, lysed by sonication, heat, 
or chemical treatment, and the homogenate is centrifuged to remove bacterial debris. 
The supernatant is then subjected to purification procedures such as ammonium 
sulfate precipitation, gel filtration, ion exchange chromatography, FPLC, and HPLC. 

The DNA molecule encoding rephcation polypeptides or proteins 

15 derived from Gram positive bacteria can be incorporated in cells using conventional 

recombinant DNA technology. Generally, this involved inserting the DNA molecule 
into an expression system to which the DNA molecule is heterologous (i.e. not 
normally present). The heterologous DNA molecule is inserted into the expression 
system or vector in proper sense orientation and correct reading frame. The vector 

20 contains the necessary elements for the transcription and translation of the inserted 

protein-coding sequences. 

U.S. Patent No. 4,237,224 to Cohen and Boyer, which is hereby 
incorporated by reference, describes the production of expression systems in the form 
of recombinant plasmids using restriction enzyme cleavage and ligation with DNA 

25 ligase. These recombinant plasmids are then introduced by means of transformation 
and replicated in unicellular cultures including procaryotic organisms and eucaryotic 
cells grown in tissue culture. 

Recombinant genes may also be introduced into viruses, such as 
vaccina virus. Recombinant viruses can be generated by transfection of plasmids into 

30 cells infected with virus. 

Suitable vectors include, but are not limited to, the following viral 
vectors such as lambda vector system gtll, gt WES.tB, Charon 4, and plasmid vectors 
such as pBR322, pBR325, pACYC177, pACYC1084, pUC8, pUC9, pUC18, pUC19, 
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pLG339, pR290, pKC37, pKClOl, SV 40, pBluescript n SK +/- or KS +/- (see 
"Stratagene Cloning Systems" Catalog (1993) from Stratagene, La Jolla, Calif, which 
is hereby incorporated by reference), pQE, pIH821, pGEX, pET series (see 
F.W. Studier et al., 'TJse of T7 RNA Polymerase to Direct Expression of Cloned 
Genes," Gene Expression Technology vol. 185 (1990), which is hereby incorporated 
by reference), and any derivatives thereof Recombinant molecules can be introduced 
into cells via transformation, particularly transduction, conjugation, mobilization, or 
electroporation. The DNA sequences are cloned into the vector using standard 
cloning procedures in the art, as described by Sambrook et ah. Molecular Cloning: A 
Laboratory Manual . Cold Springs Laboratory, Cold Springs Harbor, New York 
(1989), which is hereby incorporated by reference. 

A variety of host-vector systems may be utilized to express the protein- 
encoding sequence(s). Primarily, the vector system must be compatible with the host 
cell used. Host-vector systems include but are not limited to the following: bacteria 
transformed with bacteriophage DNA, plasmid DNA, or cosmid DNA; 
microorganisms such as yeast containing yeast vectors; mammalian cell systems 
infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems infected 
with virus (e.g., baculovirus); and plant cells infected by bacteria. The expression 
elements of these vectors vary in their strength and specificities. Depending upon the 
host- vector system utilized, any one of a number of suitable transcription and 
translation elements can be used. 

Different genetic signals and processing events control many levels of 
gene expression (e.g., DNA transcription and messenger RNA (mRNA) translation). 

Transcription of DNA is dependent upon the presence of a promotor 
which is a DNA sequence that directs the binding of RNA polymerase and thereby 
promotes mRNA synthesis. The DNA sequences of eucaryotic promoters differ from 
those of procaryotic promoters. Furthermore, eucaryotic promoters and accompanying 
genetic signals may not be recognized in or may not fimction in a procaryotic system, 
and, further procaryotic promoters are not recognized and do not function in 
eucaryotic cells. 

Similarly, translation of mRNA in procaryotes depends upon the 
presence of the proper procaryotic signals which differ from those of eukaryotes. 
Efficient translation of mRNA in procaryotes requires a ribosome binding site called 
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the Shine-Dalgamo ("SD") sequence on the mRNA. This sequence is a short 
nucleotide sequence of mRNA that is located before the same codon, usually AUG, 
which encodes the amino-terminal methionine of the protein. The SD sequences are 
complementary to the 3'-end of the 16S rRNA (ribosomal RNA) and probably 
5 promote binding of mRNA to ribosomes by duplexing with the rRNA to allow correct 
positioning of the ribosome. For a review on maximizing gene expression, see 
Roberts and Lauer, Methods in Enzymology , 68:473 (1979), which is hereby 
incorporated by reference. 

Promoters vary in their "strength" (i.e. their ability to promote 

10 transcription). For the purposes of expressing a cloned gene, it is desirable to use 

strong promoters in order to obtain a high level of transcription and, hence, expression 
of the gene. Depending upon the host cell system utilized, any one of a number of 
suitable promoters may be used. For instance, when cloning in E, coli, its 
bacteriophages, or plasmids, promoters such as the T7 phage promoter, lac promotor, 

15 (rp promotor, recA promotor, ribosomal RNA promotor, the Pr and Pl promoters of 

coliphage lambda and others, including but not limited, to /acUVS, ompF, bla, Ipp, 
and the like, may be used to direct high levels of transcription of adjacent DNA 
segments. Additionally, a hybrid trp-lacUVS (iac) promotor or other E, coli 
promoters produced by recombinant DNA or other synthetic DNA techniques may be 

20 used to provide for transcription of the inserted gene. 

Bacterial host cell strains and expression vectors may be chosen which 
inhibit the action of the promotor unless specifically induced. In certain operations, 
the addition of specific inducers is necessary for efficient transcription of the inserted 
DNA. For example, the lac operon is induced by the addition of lactose or IPTG 

25 (isopropylthio-beta-D-galactoside). A variety of other operons, such as trp, pro, etc., 
are under different controls. Additionally, the cell may carry the gene for a 
heterologous RNA polymerase such as fi-om phage T7. Thus, a promoter specific for 
T7 RNA polymerase is used. The T7 RNA polymerase may be under inducible 
control. 

30 Specific initiation signals are also required for efficient gene 

transcription and translation in procaryotic cells. These transcription and translation 
initiation signals may vary in "strength" as measured by the quantity of gene specific 
messenger RNA and protein synthesized, respectively. The DNA expression vector, 
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Which contains a promotor, may also contain any combination of various "strong" 
transcription and/or translation initiation signals. For instance, efficient translation in 
E. coll requires an SD sequence about 7-9 bases 5' to the initiation codon ("ATG") to 
provide a ribosome binding site. Thus, an SD-ATG combination that can be utilized 
by host cell ribosomes may be employed. Such combinations include but are not 
limited to the SD-ATG combination from the cro gene or the //gene of coliphage 
lambda, or from the E. coli tryptophan E, D, C, B or A genes. Additionally, any SD- 
ATG combination produced by recombinant DNA or other techniques involving 
incorporation of synthetic nucleotides may be used. 

Once the isolated DNA molecule encoding a replication polypeptide or 
protein has been cloned into an expression system, it is ready to be incorporated into a 
host cell. Such incorporation can be carried out by the various forms of 
transformation noted above, depending upon the vector/host cell system. Suitable host 
cells include, but are not limited to, bacteria, viruses, yeast, mammalian cells, insects, 
plants, and the like. 

The invention provides efficient methods of identifying 
pharmacological agents or lead compounds for agents active at the level of a 
replication protein function, particularly DNA replication. Generally, these screening 
methods involve assaying for compounds which interfere with the replication activity. 
The methods are amenable to automated, cost-effective high throughput screening of 
chemical libraries for lead compounds. Identified reagents find use in the 
pharmaceutical industries for animal and human trials; for example, the reagents may 
be derivatized and rescreened in in vitro and in vivo assays to opfimize activity and 
minimize toxicity for pharmaceutical development. Target therapeutic indications are 
limited only in that the target cellular fiinction be subject to modulation, usually 
inhibition, by disruption of a replication activity or the formation of a complex 
comprising a replication protein and one or more natural inti-acellular binding targets. 
Target indications may include arresting cell growth or causing cell death resulting in 
recovery from the bacterial infection in animal studies. 

A wide variety of assays for activity and binding agents are provided, 
including DNA synthesis. ATPase, clamp loading onto DNA, protein-protein binding 
assays, immunoassays, cell based assays, etc. The replication protein compositions, 
used to identify pharmacological agents, are in isolated, partially pure or pure form 
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and are typically recombinantly produced. The replication protein may be part of a 
fusion product with another peptide or polypeptide (e.g., a polypeptide that is capable 
of providing or enhancing protein-protein binding, stability under assay conditions 
(e.g., a tag for detection or anchoring), etc.). The assay mixtures comprise a natural 
intracellular replication protein binding target such as DNA, another protein, NTP, or 
dNTP. For binding assays, while native binding targets may be used, it is frequently 
preferred to use portions (e.g., peptides, nucleic acid fragments) thereof so long as the 
portion provides binding affinity and avidity to the subject replication protein 
conveniently measurable in the assay. The assay mixture also comprises a candidate 
pharmacological agent. Generally, a plurality of assay mixtures are run in parallel 
with different agent concentrations to obtain a differential response to the various 
concentrations. Typically, one of these concentrations serves as a negative control 
(i.e., at zero concentration or below the limits of assay detection). Additional controls 
are often present such as a positive control, a dose response curve, use of known 
inhibitors, use of control heterologous proteins, etc. Candidate agents encompass 
numerous chemical classes, though typically they are organic compounds; preferably 
they are small organic compounds and are obtained from a wide variety of sources, 
including libraries of synthetic or natural compounds. A variety of other reagents may 
also be included in the mixture. These include reagents like salts, buffers, neutral 
proteins (e.g., albumin, detergents, etc.), which may be used to facilitate optimal 
binding and/or reduce nonspecific or background interactions, etc. Also reagents that 
otherwise improve the efficiency of the assay (e.g., protease inhibitors, nuclease 
inhibitors, antimicrobial agents, etc.) may be used. 

The invention provides replication protein specific assays and the 
binding agents including natural intracellular binding targets such as other replication 
proteins, etc., and methods of identifying and making such agents, and their use in a 
variety of diagnostic and therapeutic applications, especially where disease is 
associated with excessive cell growth. Novel replication protein-specific binding 
agents include replication protein-specific antibodies and other natural intracellular 
binding agents identified with assays such as one- and two-hybrid screens, non-natural 
intracellular binding agents identified in screens of chemical libraries, etc. 

Generally, replication protein-specificity of the binding agent is showTi 
by binding equilibrium constants. Such agents are capable of selectively binding a 
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replication protein (i.e., with an equilibrium constant at least about 10^ M'\ 
preferably, at least about 10^ M'*, more preferably, at least about 10^ M''). A wide 
variety of cell-based and cell-free assays may be used to demonstrate replication 
protein-specific activity, binding, gel shift assays, immunoassays, etc. 
5 The resultant mixture is incubated under conditions whereby, but for 

the presence of the candidate pharmacological agent, the replication protein 
specifically binds the cellular binding target, portion, or analog. The mixture of 
components can be added in any order that provides for the requisite bindings. 
Incubations may be performed at any temperature which facilitates optimal binding, 

10 typically between 4°C and 40**C, more commonly between 1 S'^C and 40^*0. Incubation 

periods are likewise selected for optimal binding but also minimized to facilitate 
rapid, high-throughput screening, and are typically between 0.1 and 10 hours, 
preferably less than 5 hours, more preferably less than 2 hours. 

After incubation, the presence or absence of activity or specific binding 

15 between the replication protein and one or more binding targets is detected by any 

convenient way. For cell-free activity and binding type assays, a separation step may 
be used to separate the activity product or the bound from unbound components. 
Separation may be effected by precipitation (e.g., immunoprecipitation), 
iimnobilization (e.g., on a solid substrate such as a microtiter plate), etc., followed by 

20 washing. Many assays that do not require separation are also possible such as use of 
europium conjugation in proximity assays or a detection system that is dependent on a 
product or loss of substrate. 

Detection may be effected in any convenient way. For cell-free activity 
and binding assays, one of the components usually comprises or is coupled to a label. 

25 A wide variety of labels may be employed - essentially any label that provides for 
detection of DNA product, loss of DNA substrate, conversion of a nucleotide 
substrate, or bound protein is useful. The label may provide for direct detection such 
as radioactivity, fluorescence, luminescence, optical, or electron density, etc. or 
indirect detection such as an epitope tag, an enzyme, etc. The label may be appended 

30 to the protein (e.g., a phosphate group comprising a radioactive isotope of 

phosphorous), or incorporated into the DNA substrate or the protein structure (e.g., a 
methionine residue comprising a radioactive isotope of sulfiir.) A variety of methods 
may be used to detect the label depending on the nature of the label and other assay 
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components. For example, the label may be detected bound to the solid substrate, or a 
portion of the bound complex containing the label may be separated from the solid 
substrate, and thereafter the label detected. Labels may be directly detected through 
optical or electron density, radioactive emissions, nonradiative energy transfer, 
fluorescence emission, etc. or indirectly detected with antibody conjugates, etc. For 
example, in the case of radioactive labels, emissions may be detected directly (e.g., 
with particle counters) or indirectly (e.g., with scintillation cocktails and counters). 

The present invention identifies the set of proteins that together result 
in a three component polymerase from bacteria that are distantly related to E, coli, 
such as Gram positive bacteria. Specifically, these bacteria lack several genes that E. 
coli DNA polymerase m has, such as holD, holD or holE, Further, dnaX 'xs believed 
to encode only one protein, tau. Also, holA is quite divergent in homology suggesting 
it may function in another process in these organisms. Gram positive cells even have 
replication genes that E. coli does not, implying that they may not utilize the 
replication strategies exemplified hy E. coli. 

The present invention identifies genes and proteins that form a three 
component polymerase in Gram positive organisms, such as S. pyogenes and S. 
aureus. In S. pyogenes and S. aureus, the polymerase a-large, functions with a 
p clamp and a clamp loader component of t55'. They display high speed and 
processivity in synthesis of ssDNA coated with SSB and primed with a DNA 
oligonucleotide. 

This invention also expresses and purifies a protein from a Gram 
positive bacteria that is homologous to the E. coli beta subunit. The invention 
demonstrates that it behaves like a circular protein. Further, this invention shows that 
a beta subunit from a Gram positive bacteria is functional with both Pol ID-L (a-large) 
from a Gram positive bacteria and with DNA polymerase HI from a Gram negative 
bacteria. This result can be explained by an interaction between the clamp and the 
polymerase that has been conserved during the evolutionary divergence of Gram 
positive and Gram negative cells. A chemical inhibitor that would disrupt this 
interaction would be predicted to have a broad spectrum of antibiotic activity, shutting 
down replication in gram negative and gram positive cells alike. This assay, and 
others based on this interaction, can be devised to screen chemicals for such 
inhibition. Further, since all the proteins in this assay are highly overexpressed 
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through recombinant techniques, sufficient quantities of the protein reagents can be 
obtained for screening hundreds of thousands of compoimds. 

This invention also shows that the DnaE polymerase (a-small), 
encoded by the dnaE gene, functions with the beta clamp and t65' complex. The 
speed of DnaE is not significantly increased by t58' and p, but the processivity of 
DnaE is greatly increased by t66' and p. Hence, the DnaE polymerase, coupled with 
its (3 clamp on DNA (loaded by t65') may also be an important target for a candidate 
pharmaceutical drug. 

The present invention provides methods by which replication proteins 
from a Gram positive bacteria are used to discover new pharmaceutical agents. The 
function of replication proteins is quantified in the presence of different chemical 
compounds. A chemical compound that inhibits the function is a candidate antibiotic. 
Some replication proteins from a Gram positive bacteria and from a Gram negative 
bacteria can be interchanged for one another. Hence, they can function as mixtures. 
Reactions that assay for the function of enzyme mixtures consisting of proteins from 
Gram positive bacteria and from Gram negative bacteria can also be used to discover 
drugs. Suitable E, coli replication proteins are the subunits of its Pol III holoenzyme 
which are described in U.S. Patent Nos. 5,583,026 and 5,668,004 to O'Donnell, which 
are hereby incorporated by reference. 

The methods described herein to obtain genes, and the assays 
demonstrating activity behavior of S. pyogenes and S. aureus replication proteins are 
likely to generalize to all members of the Streptococcus and Staphylococcus genuses, 
as well as to all Gram positive bacteria. Such assays are also likely to generalize to 
other cells besides Gram positive bacteria which also share features in common with 
S, pyogenes and S. aureus that are different from E, coli (i.e., lacking holC, holD, or 
holE\ having a dnaXgtn^ encoding a single protein; or having a weak homology to 
hoIA encoding delta). 

The present invention describes a method of identifying compounds 
which inhibit the activity of a polymerase product ofpolC or dnaE. This method is 
carried out by forming a reaction mixture that includes a primed DNA molecule, a 
polymerase product of polC or dnaE, a candidate compound, a dNTP, and optionally 
either a beta subunit, a tau complex, or both the beta subunit and the tau complex, 
wherein at least one of the polymerase product of polC ox dnaE, the beta subunit, the 
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tau complex, or a subunit or combination of subimits thereof is derived from a 
Eubacteria other than Escherichia coli\ subjecting the reaction mixture to conditions 
effective to achieve nucleic acid polymerization in the absence of the candidate 
compound; analyzing the reaction mixture for the presence or absence of nucleic acid 
polymerization extension products; and identifying the candidate compound in the 
reaction mixture where there is an absence of nucleic acid polymerization extension 
products. Preferably, the polymerase product of polC or dnaE, the beta subunit, the 
tau complex, or the subunit or combination of subunits thereof is derived from a Gram 
positive bacterium, more preferably a Streptococcus bacterium such as S, pyogenes or 
a Staphylococcus bacterium such as S. aureus. 

The present invention describes a method to identify chemicals that 
inhibit the activity of the three component polymerase. This method involves 
contacting primed DNA with the DNA polymerase in the presence of the candidate 
pharmaceutical, and dNTPs (or modified dNTPs) to form a reaction mixture. The 
reaction mixture is subjected to conditions effective to achieve nucleic acid 
polymerization in the absence of the candidate pharmaceutical and the presence or 
absence of the extension product in the reaction mixture is analyzed. The candidate 
pharmaceutical is detected by the absence of product. 

The present invention describes a method to identify candidate 
pharmaceuticals that inhibit the activity of a clamp loader complex and a beta subunit 
in stimulating the DNA polymerase. The method includes contacting a primed DNA 
(which may be coated with SSB) with a DNA polymerase, a beta subunit, and a tau 
complex (or subunit or subassembly of the tau complex) in the presence of the 
candidate pharmaceutical, and dNTPs (or modified dNTPs) to form a reaction 
mixture. The reaction mixture is subjected to conditions which, in the absence of the 
candidate pharmaceutical, would effect nucleic acid polymerization and the presence 
or absence of the extension product in the reaction mixture is analyzed. The candidate 
pharmaceutical is delected by the absence of product. The DNA polymerase, the beta 
subunit, and/or the tau complex or subunit(s) thereof are derived from a Gram positive 
bacterium. 

The present invention describes a method to identify chemicals that 
inhibit the ability of a beta subunit and a DNA polymerase to interact physically. This 
method involves contacting the beta subunit with the DNA polymerase in the presence 
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Of the candidate phaimaceutical to form a reaction mixture. The reaction mixture is 
subjected to conditions under which the DNA polymerase and the beta subunit would 
interact in the absence of the candidate pharmaceutical. The reaction mixture is then 
analyzed for interaction between the beta unit and the DNA polymerase. The 
candidate pharmaceutical is detected by the absence of interaction between the beta 
subunit and the DNA polymerase. The DNA polymerase and/or the beta subunit are 
derived from a Gram positive bacterium. 

The present invention describes a method to identify chemicals that 
inhibit the ability of a beta subunit and a tau complex (or a subunit or subassembly of 
the tau complex) to interact. This method includes contacting the beta subunit with 
the tau complex (or subunit or subassembly of the tau complex) in the presence of the 
candidate pharmaceutical to form a reaction mixture. The reaction mixture is 
subjected to conditions under which the tau complex (or the subunit or subassembly 
of the tau complex) and the beta subunit would interact in the absence of the candidate 
pharmaceutical. The reaction mixture is then analyzed for interaction between the 
beta subunit and the tau complex (or the subunit or subassembly of the tau complex). 
The candidate phaimaceutical is detected by the absence of interaction between the 
beta subunit and the tau complex (or the subunit or subassembly of the tau complex) . 
The beta subunit and/or the tau complex or subunit thereof is derived from a Gram 
positive bacterium. 

The present invention describes a method to identify chemicals that 
inhibit the ability of a tau complex (or a subassembly of the tau complex) to assemble 
a beta subunit onto a DNA molecule. This method involves contacting a circular 
primed DNA molecule (which maybe coated with SSB) with the tau complex (or the 
subassembly thereof) and the beta subunit in the presence of the candidate 
pharmaceutical, and ATP or dATP to form a reaction mixture. The reaction mixture 
is subjected to conditions under which the tau complex (or subassembly) assembles 
the beta subunit on the DNA molecule absent the candidate pharmaceutical. The 
presence or absence of the beta subunit on the DNA molecule in the reaction mixture 
IS analyzed. The candidate pharmaceutical is detected by the absence of the beta 
subunit on the DNA molecule. The beta subunit and/or the tau complex are derived 
from a Gram positive bacterium. 
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The present invention describes a method to identify chemicals that 
inhibit the ability of a tau complex (or a subunit(s) of the tau complex) to disassemble 
a beta subunit from a DNA molecule. This method comprises contacting a DNA 
molecule onto which the beta subunit has been assembled in the presence of the 
candidate pharmaceutical, to form a reaction mixture. The reaction mixture is 
subjected to conditions under which the tau complex (or a subunit(s) or subassembly 
of the tau complex) disassembles the beta subunit from the DNA molecule absent the 
candidate pharmaceutical. The presence or absence of the beta subunit on the DNA 
molecule in the reaction mixture is analyzed. The candidate pharmaceutical is 
detected by the presence of the beta subunit on the DNA molecule. The beta subunit 
and/or the tau complex are derived from a Gram positive bacterium. 

The present invention describes a method to identify chemicals that 
disassemble a beta subunit from a DNA molecule. This method involves contacting a 
circular primed DNA molecule (which may be coated with SSB) upon which the beta 
subunit has been assembled (e.g. by action of the tau complex) with the candidate 
pharmaceutical. The presence or absence of the beta subunit on the DNA molecule in 
the reaction mixture is analyzed. The candidate pharmaceutical is detected by the 
absence of the beta subunit on the DNA molecule. The beta subunit is derived from a 
Gram positive bacterium. 

The present invention describes a method to identify chemicals that 
inhibit the dATP/ATP binding activity of a tau complex or a tau complex subunit (e.g. 
tau subunit). This method includes contacting the tau complex (or the tau complex 
subunit) with dATP/ATP either in the presence or absence of a DNA molecule and/or 
the beta subunit in the presence of the candidate pharmaceutical to form a reaction. 
The reaction mixture is subjected to conditions in which the tau complex (or the 
subunit of tau complex) interacts with dATP/ATP in the absence of the candidate 
pharmaceutical. The reaction is analyzed to determine if dATP/ATP is bound to the 
tau complex (or the subunit of tau complex) in the presence of the candidate 
pharmaceutical. The candidate pharmaceutical is detected by the absence of 
hydrolysis. The tau complex and/or the beta subunit is derived from a Gram positive 
bacterium. 

The present invention describes a method to identify chemicals that 
inhibit the dATP/ATPase activity of a tau complex or a tau complex subunit (e.g., the 
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tau subunit). This method involves contacting the tau complex (or the tau complex 
subunit) with dATP/ATP either in the presence or absence of a DNA molecule and/or 
a beta subunit in the presence of the candidate pharmaceutical to form a reaction 
mixture. The reaction mixture is subjected to conditions in which the tau subunit (or 
complex) hydrolyzes dATP/ATP in the absence of the candidate pharmaceutical. The 
reaction is analyzed to determine if dATP/ATP was hydrolyzed. Suitable candidate 
pharmaceuticals are identified by the absence of hydrolysis. The tau complex and/or 
the beta subunit is derived from a Gram positive bacterium. 

Further methods for identifying chemicals that inhibit the activity of a 
DNA polymerase encoded by either the dnaE gene, polC gene, or their accessory 
proteins (i.e., clamp loader, clamp, etc.), are as follows: 

1) Contacting a primed DNA molecule with the encoded product 
of the dnaE gene or polC gene in the presence of the candidate pharmaceutical, and 
dNTPs (or modified dNTPs) to form a reaction mixture. The reaction mixture is 
subjected to conditions, which in the absence of the candidate pharmaceutical, affect 
nucleic acid polymerization and the presence or absence of the extension product in 
the reaction mixture is analyzed. The candidate pharmaceutical is detected by the 
absence of extension product. The protein encoded by the dnaE gene and PolC gene 
is derived from a Gram positive bacterium. 

2) Contacting a linear primed DNA molecule with a beta subunit 
and the encoded product of dnaE or PolC in the presence of the candidate 
pharmaceutical, and dNTPs (or modified dNTPs) to form a reaction mixture. The 
reaction mixture is subjected to conditions, which in the absence of the candidate 
pharmaceutical, affect nucleic acid polymerization, and the presence or absence of the 
extension product in the reaction mixture is analyzed. The candidate pharmaceutical 
is detected by the absence of extension product. The protein encoded by the dnaE 
gene and PolC gene is derived from a Gram positive bacterium. 

3) Contacting a circular primed DNA molecule (may be coated 
with SSB) with a tau complex, a beta subunit and the encoded product of a dnaE gene 
or Pole gene in the presence of the candidate pharmaceutical, and dNTPs (or 
modified dNTPs) to form a reaction mixture. The reaction mixture is subjected to 
conditions, which in the absence of the candidate pharmaceutical, affect nucleic acid 
polymerization, and the presence or absence of the extension product in the reaction 
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mixture is analyzed. The candidate phannaceutical is detected by the absence of 
product. The protein encoded by the dnaE gene and PolC gene, the beta subunit, 
and/or the tau complex are derived from a Gram positive bacterium. 

4) Contacting a beta subunit with the product encoded by a dnaE 
5 gene or PolC gene in the presence of the candidate pharmaceutical to form a reaction 

mixture. The reaction mixture is then analyzed for interaction between the beta 
subunit and the product encoded by the dnaE gene or PolC gene. The candidate 
pharmaceutical is detected by the absence of interaction between the beta subunit and 
the product encoded by the dnaE gene or PolC gene. The beta subunit and/or the 
10 protein encoded by the dnaE gene and PolC gene is derived from a Gram positive 
bacterium. 

5) The present invention discloses a method to identify chemicals 
that inhibit a DnaB helicase. The method includes contacting the DnaB helicase with 
a DNA molecule substrate that has a duplex region in the presence of a nucleoside or 

15 deoxynucleoside triphosphate energy source and a candidate phannaceutical to form a 
reaction mixture. The reaction mixture is subjected to conditions that support helicase 
activity in the absence of the candidate pharmaceutical. The DNA duplex molecule in 
the reaction mixture is analyzed for whether it is converted to ssDNA. The candidate 
pharmaceutical is detected by the absence of conversion of the duplex DNA molecule 

20 to the ssDNA molecule. The DnaB helicase is derived from a Gram positive 
bacterium. 

6) The present invention describes a method to identify chemicals 
that inhibit the nucleoside or deoxynucleoside triphosphatase activity of a DnaB 
helicase. The method includes contacting the DnaB helicase with a DNA molecule 

25 substrate that has a duplex region in the presence of a nucleoside or deoxynucleoside 
triphosphate energy source and the candidate pharmaceutical to form a reaction 
mixture. The reaction mixture is subjected to conditions that support nucleoside or 
deoxynucleoside triphosphatase activity of the DnaB helicase in the absence of the 
candidate pharmaceutical. The candidate pharmaceutical is detected by the absence of 

30 conversion of nucleoside or deoxynucleoside triphosphate to nucleoside or 

deoxynucleoside diphosphate. The DnaB helicase is derived from a Gram positive 
bacterium. 
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7) The present invention describes a method to identify chemicals 
that inhibit a primase. The method includes contacting primase with a ssDNA 
molecule in the presence of a candidate pharmaceutical to form a reaction mixture. 
The reaction mixture is subjected to conditions that support primase activity (e.g., the 
presence of nucleoside or deoxynucleoside triphosphates, appropriate buffer, presence 
or absence of DnaB helicase) in the absence of the candidate pharmaceutical. Suitable 
candidate pharmaceuticals are identified by the absence of primer formation detected 
either directly or indirectly. The primase is derived from a Gram positive bacterium. 

8) The present invention describes a method to identify chemicals 
that inhibit the ability of a primase and the protein encoded by a dnaB gene to interact. 
This method includes contacting the primase with the protein encoded by the dnaB 
gene in the presence of the candidate pharmaceutical to form a reaction mixture. The 
reaction mixture is subjected to conditions under which the primase and the protein 
encoded by the dnaB gene interact in the absence of the candidate pharmaceutical. 
The reaction mixture is then analyzed for interaction between the primase and the 
protein encoded by the dnaB gene. The candidate pharmaceutical is detected by the 
absence of interaction between the primase and the protein encoded by the dnaB gene. 
The primase and/or the dnaB gene are derived from a Gram positive bacterium. 

9) The present invention describes a method to identify chemicals 
that inhibit the ability of a protein encoded by a dnaB gene to interact with a DNA 
molecule. This method includes contacting the protein encoded by the dnaB gene 
with the DNA molecule in the presence of the candidate pharmaceutical to form a 
reaction mixture. The reaction mixture is subjected to conditions under which the 
DNA molecule and the protein encoded by the dnaB gene interact in the absence of 
the candidate pharmaceutical. The reaction mixture is then analyzed for interaction 
between the protein encoded by the dnaB gene and the DNA molecule. The candidate 
pharmaceutical is detected by the absence of interaction between the DNA molecule 
and the protein encoded by the dnaB gene. The dnaB gene is derived from a Gram 
positive bacterium. 
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EXAMPLES 

The following examples are provided to illustrate embodiments of the 
present invention, but they are by no means intended to limit its scope. 

Example 1 - Materials 

Labeled deoxy- and ribonucleoside triphosphates were from Dupont- 
New England Nuclear; unlabeled deoxy- and ribonucleoside triphosphates were from 
Pharmacia-LKB; E. coli replication proteins were purified as described, alpha, 
epsilon, gamma, and tau (Studwell et al., 'Trocessive Replication is Contingent on the 
Exonuclease Subunit of DNA Polymerase ffl Holoenzyme," J. Biol. Chem. . 265:1 171- 
1178(1 990), which is hereby incorporated by reference), beta (Kong et al ., "Three 
Dimensional Structure of the Beta Subunit oi Escherichia coli DNA Polymerase III 
Holoenzyme: A Sliding DNA Clamp," Cell, 69:425-437 (1992), which is hereby 
incorporated by reference), delta and delta prime (Dong et al., "DNA Polymerase m 
Accessory Proteins. I. HolA and holB Encoding 5 and 5'," J. Biol. Chem. . 268: 11 758- 
1 1765 (1993), which is hereby incorporated by reference), chi and psi (Xiao et al., 
"DNA Polymerase ffl Accessory Proteins, ffl. HolC and holD Encoding chi and psi," 
J. Biol. Chem., 268: 1 1 773-1 1 778 (1 993), which is hereby incorporated by reference), 
theta (Studwell-Vaughan et al., "DNA Polymerase ffl Accessory Proteins. V. Theta 
Encoded by holE," J. Biol. Chem. . 268:1 1785-1 1791 (1993). which is hereby 
incorporated by reference), and SSB (Weiner et al., "The Deoxyribonucleic Acid 
Unwinding Protein of Escherichia coli" J. Biol. Chem.. 250: 1 972- 1 980 ( 1 975), 
which is hereby incorporated by reference). E. coli Pol ffl core and clamp loader 
complex (composed of subunits gamma, delta, delta prime, chi, and psi) were 
reconstituted as described in Onrust et al., "Assembly of a Chromosomal Replication 
Machine: Two DNA Polymerases, a Clamp Loader and Sliding Clamps in One 
Holoenzyme Particle. I. Organization of the Clamp Loader," J. Biol. Chem .. 
270:13348-13357 (1995), which is hereby incorporated by reference. Pol ffl* was 
reconstituted and purified as described in Onrust et al., "Assembly of a Chromosomal 
Replication Machine: Two DNA Polymerases, a Clamp Loader and Sliding Clamps 
in One Holoenzyme Particle, ffl. Interface Between Two Polymerases and the Clamp 
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Loader," J. Biol. Chem., 270:13366-13377 (1995), which is hereby incorporated by 
reference. Protein concentrations were quantitated by the Protein Assay (Bio-Rad) 
method using bovine serum albumin (BSA) as a standard. DNA oligonucleotides 
were synthesized by Oligos etc. Calf thymus DNA was jfrom Sigma. Buffer A is 20 
mM Tris-HCl (pH=7.5), 0.5 mM EDTA, 2 mM DTT, and 20% glycerol Replication 
buffer is 20 mM Tris-Cl (pH 7.5), 8 mM MgC]2, 5 mM DTT, 0.5 mM EDTA, 40 
Mg/ml BSA, 4% glycerol, 0.5 mM ATP, 3 mM each dCTP, dGTP, dATP, and 20 ^M 
[a-^2p]dTTP. P-cell buffer is 50 mM potassium phosphate (pH 7.6), 5 mM DTT, 0.3 
mM EDTA, 20% glycerol. T.E. buffer is 10 mM Tris-HCl (pH 7.5), 1 mM EDTA. 
Cell lysis buffer is 50 mM Tris-HCl (pH 8.0) 10 % sucrose, 1 M NaCl, 0.3 mM 
spermidine. 

Example 2 - Calf Thymus DNA Replication Assays 

These assays were used in the purification of DNA polymerases from 
S. aureus cell extracts. Assays contained 2.5 ^g activated calf thymus DNA in a final 
volume of 25 ^1 rephcation buffer. An aliquot of the fraction to be assayed was added 
to the assay mixture on ice followed by incubation at 37**C for 5 min. DNA synthesis 
was quantitated using DE81 paper as described in Rowen et ah, "Primase, the DnaG 
Protein of Escherichia colL An Enzyme Which Starts DNA Chains," J. BioL Chem. . 
253:758-764 (1979), which is hereby incorporated by reference. 

Example 3 - Polyd A-oligodT Replication Assays 

PolydA-oligodT was prepared as follows. PolydA of average length 
4500 nucleotides was purchased from SuperTecs. 01igodT35 was synthesized by 
Oligos etc. 145 ul of 5.2 mM (as nucleotide) polydA and 22 ^1 of 1 .75 mM (as 
nucleotide) oligodT were mixed in a final volume of 2100 ^l T.E. buffer (ratio as 
nucleotide was 2 1 : 1 polydA to oligodT). The mixture was heated to boiling in a 1 ml 
eppendorf tube, then removed and allowed to cool to room temperature. Assays were 
performed in a final volume of 25 ^il 20 mM Tris-Cl (pH 7.5), 8 mM MgCb, 5 mM 
DTT, 0.5 mM EDTA, 40 ^ig/ml BSA, 4% glycerol, containing 20 ^M [a-32p]dTTP 
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and 0.36 ng poIydA-oligodT. Proteins were added to the reaction on ice, then shifted 
to 37»C for 5 min. DNA synthesis was quantitated using DE81 paper as described in 
Rowen et al., "Primase, the DnaG Protein of Escherichia coli. An Enzyme Which 
Starts DNA Chains," J. Biol. Chem. . 253:758-764 (1979), which is hereby 
incorporated by reference. 



Example 4 - Singly Primed M13mpl8 ssDNA Replication Assays 

M13mpl8 was phenol extracted from phage and purified by two 
successive bandings (one downward and one upward) in cesium chloride gradients. 
M13mpl8 ssDNA was singly primed with a DNA 30mer (map position 6817-6846) as 
described in Studwell et al. "Processive Replication is Contingent on the Exonuclease 
Subunit of DNA Polymerase ffl Holoenzyme," J. Biol. Chem. . 265: 1 171-11 78 (1990), 
which is hereby incorporated by reference. Replication assays contained 72 ng of 
singly primed M13mpl8 ssDNA in a final volume of 25 m1 of replication buffer. 
Other proteins added to the assay, and their amounts, are indicated in the Brief 
Description of the Drawings. Reactions were incubated for 5 min. at 37°C and then 
were quenched upon adding an equal volume of 1% SDS and 40 mM EDTA. DNA 
synthesis was quantitated using DE81 paper as described in Rowen et al., "Primase, 
the DnaG Protein of Escherichia coli. An Enzyme Which Starts DNA Chains," J, 
Biol. Chem. , 253:758-764 (1979), which is hereby incorporated by reference, and 
product analysis was performed in a 0.8% native agarose gel followed by 
autoradiography. 



Example 5 - Genomic Staphylococcus aureus DNA 



Two strains of S. aureus were used. For PCR of the first fragment of 
the dnaX gene sequence, the strain was ATCC 25923. For all other work the strain 
was strain 4220 (a gift of Dr. Pat Schlievert, University of Minnisota). This strain 
lacks a gene needed for producing toxic shock (Kreiswirth et al., "The Toxic Shock 
Syndrome Exotoxin Structural Gene is Not Detectably Transmitted by a Prophage," 
Nature. 305:709-712 (1996) and Balan et al., "Autocrine Regulation of Toxin 
Synthesis \)y Staphylococcus aureus," ^oc. Natl. Acad. Sci. USA 92:1619-1623 
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(1 995), which are hereby incorporated by reference). S. aureus cells were grown 
overnight at 37°C in LB containing 0.5% glucose. Cells were collected by 
centrifiigation (24 g wet weight). Cells were resuspended in 80 ml solution 1 (50 mM 
glucose, 10 mM EDTA, 25 mM Tris-HCL (pH 8.0)). SDS and NaOH were then 
added to 1% and 0.2 N, respectively, followed by incubation at 65^ for 30 min. to 
lyse the cells. 68.5 ml of 3 M sodium acetate (pH 5.0) was added followed by 
centrifugation at 12,000 rpm for 30 min. The supernatant was discarded and the pellet 
was washed twice with 50 ml of 6M urea, 10 mM Tris-HCL (pH 7.5), 1 mM EDTA 
using a dounce homogemzer. After each wash, the resuspended pellet was collected 
by centrifiigation (1 2,000 ipm for 20 min.). After the second wash, the pellet was 
resuspended in 50 ml 10 mM T.E. buffer using a dounce homogenizer and then 
incubated for 30 min. at 65"C. The solution was centrifiiged at 12,000 rpm for 20 
min., and the viscous supernatant was collected. 43.46 g CsC^ was added to the 50 
ml of supernatant (density between 1 .395-1 .398) and poured into two 35 ml quick seal 
ultracentrifiige tubes (tubes were completely filled using the same density of CsCl2 in 
T.E.). To each tube was added 0.5 ml of a 10 mg/ml stock of ethidium bromide. 
Tubes were spun at 55,000 ipm for 18 h at WC in a Sorvall TV860 rotor. The band 
of genomic DNA was extracted using a syringe and needle. Ethidium bromide was 
removed using two butanol extractions and then dialyzed against 4 1 of T.E. at pH 8.0 
overnight. The DNA was recovered by ethanol precipitation and then resuspended in 
T.E. buffer (1 .7 mg total) and stored at -20°C. 

Example 6 - Cloning and Purification ofS. aureus Pol III-L 

To fiirther characterize the mechanism of DNA replication in S. 
aureus, large amounts of its replication proteins were produced through use of the 
genes. The polC gene encoding S. aureus Pol lO-L (alpha-large) subunit has been 
sequenced and expressed in E. coli (Pacitti et al., "Characterization and 
Overexpression of the Gene Encoding Staphylococcus aureus DNA Polymerase m," 
Gene, 1 65:5 1-56 (1 995), which is hereby incorporated by reference). The previous 
work utiUzed a pBS[KS] vector for expression in which the E. coli RNA polymerase 
is used for gene transcription. In the earlier study, the S. aureus polC gene was 
precisely cloned at the 5' end encoding the N-tetminus, but the amount of the gene 
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that remained past the 3' end was not disclosed and the procedure for subcloning the 
gene into the expression vector was only briefly summarized. Furthermore, the 
previous study does not show the level of expression of the 5. aureus Pol DI-L, nor the 
amount of S. aureus Pol III-L that is obtained from the induced cells. Since the 
5 previously published procedure could not be repeated and the efficiency of the 

expression vector could not be assessed, another strategy outlined below had to be 
developed. 

The isolated polC gene was cloned into a vector that utilizes T7 RNA 
polymerase for transcription as this process generally expresses a large amount of 

10 protein. Hence, the S. aureus polC gene was cloned precisely into the start codon at 

the Ndel site downstream of the T7 promotor in a pET vector . As the polC gene 
contains an internal Ndel site, the entire gene could not be amplified and placed it into 
the Ndel site of a pET vector. Hence, a three step cloning strategy that yielded the 
desired clone was devised (Figure 1). These attempts were quite frustrating initially 

15 as no products of cloning in standard E. coli strains such as DH5a, a typical 

laboratory strain for preparation of DNA, could be obtained. Finally, a cell that was 
mutated in several genes affecting DNA stability was useful in obtaining the desired 
products of cloning. 

In brief, the cloning strategy required use of another expression vector 

20 (called pET1137kDa) in which the 37 kDa subunit of human RFC, the clamp loader 
of the human replication system, had been cloned into the pETl 1 vector. The gene 
encoding the 37kDa subunit contains an internal Nsil site, which was needed for the 
precise cloning of the isolated polC gene. This three step strategy is shown in 
Figure 1. In the first step, an approximately 2.3 kb section of the 5* section of the gene 

25 (encoding the N-terminus of Pol lU-L) was amplified using the polymerase chain 
reaction (PGR). Primers were as follows: 

Upstream (SEQ. ID. No. 35) 

ggtggtaatt gtcttg cata tgacagagc 29 



Downstream (SEQ. ID. No. 36) 
agcgattaag tggattgccg ggttgtgatg c 



31 
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Amplification was performed using 500 ng genomic DNA, 0.5 mM EDTA, 1 nM of 
each primer, ImM MgS04, 2 units vent DNA polymerase (New England Biolabs) in 
100 Ml of vent buffer (New England Biolabs). Forty cycles were performed using the 
following cycling scheme: 94X, 1 min; 60°C, 1 min.; 72°C. 2.5 min. The product 
was digested with Ndel (underlined in the upstream primer) and Nsil (an internal site 
in the product) and the approximately 1.8 kb fragment was gel purified. A pETl 1 
vector containing as an insert the 37 kDa subunit of human rephcation factor C 
(pETl 1 37kDa) was digested with Ndel and Nsil and gel purified. The PCR fi-agment 
was ligated into the digested pETl 137kDa vector and the ligation reaction was 
transformed into Epicurean coll supercompetent SURE 2 cells (Stratagene) and 
colonies were screened for the correct chimera (pETl IPolCl) by examining 
minipreps for proper length and correct digestion products using Ndel and Nsil. In the 
second step, an approximately 2076 bp fi-agment containing the DNA encoding the C- 
terminus of Pol IH-L subunit was amplified using the following sequences as primers: 



Upstream (SEQ. ID. No. 37) 
agcatcacaa cccggcaatc cacttaatcg c 

Dovynstream (SEQ. ID. No. 38) 
gactacgcca tgggcattaa ataaatacc 



31 



29 



The amplification cycling scheme was as described above except the elongation step 
at 72°C was for 2 min. The product was digested with BamHI (underlined in the 
downstream primer) and Nsil (internal to the product) and the approximately 480 bp 
product was gel purified and ligated into the pETl IPolCl that had been digested with 
Nsil/BamHI and gel purified (ligated product is pETl lPoIC2). To complete the 
expression vector, an approximately 2080 bp PCR product wa^ amplified over the two 
Nsil sites internal to the gene using the following primers: 



Upstream (SEQ. ID. No. 39) 
gaag atgcat ataaacgtgc aagacctagt 

Downstream (SEQ. ID. No. 40) 

gtctgacgca cgaattgtaa agtaa gatgc atag 
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The amplification cycling scheme was as described above except the 72^C elongation 
step was 2 min. The PGR product, and the pETl lPolC2 vector, were digested with 
Nsil and ge! purified. The ligation mixture was transformed as described above and 
colonies were screened for the correct chimera (pETl IPolC). 

To express Pol IQ-L polymerase, the pETl IPolC plasmid was 
transformed into E. coli strain BL21(DE3). 24 L off. coli BL21(DE3)pETl IPolC 
were grown in LB media containing 50 pg/ml ampicillin at 37X to an OD of 0.7 and 
then the temperature was lowered to 15°C. Cells were then induced for Pol IH-L 
expression upon addition of 1 mM IPTG to produce the T7 RNA polymerase needed 
to transcribe /7o/C This step was followed by further incubation at 15°C for 18 h. 
Expression of 5. aureus Pol m-L polymerase was so high that it could easily be 
visualized by Coomassie staining of a SDS polyacrylamide gel of whole cells 
(Figure 2A). The expressed protein migrated in the SDS polyacrylamide gel in a 
position expected for a 165 kDa polypeptide. In this procedure, it is important that 
cells are induced at 15°C, as induction at 37°C produces a truncated version of Pol IH- 
L polymerase, of approximately 130 kDa. 

Cells were collected by centrifugation at 5X. Cells (12 g wet weight) 
were stored at -70°C. The following steps were performed at 4^C. Cells were thawed 
and lysed in cell lysis buffer as described (final volume = 50 ml) and were passed 
through a French Press (Amico) at a minimum of 20,000 psi. PMSF (2 mM) was 
added to the lysate as the lysate was collected fi-om the French Press. DNA was 
removed and the lysate was clarified by centrifugation. The supematent was dialyzed 
for 1 h against Buffer A containing 50 mM NaCL The final conductivity was 
equivalent to 190 mM NaCL Supematent (24 ml, 208 mg) was diluted to 50 ml using 
Buffer A to bring the conductivity to 96 mM MgCb, and then was loaded onto an 8 
ml MonoQ column equilibrated in Buffer A containing 50 mM NaCL The column 
was eluted with a 160 ml linear gradient of Buffer A firom 50 mM NaCl to 500 mM 
NaCL Seventy five fi-actions (1.3 ml each) were collected (Figure 2B). Aliquots were 
analyzed for their ability to synthesize DNA, and 20 \x\ of each fi-action was analyzed 
by Coomassie staining of an SDS polyacrylamide gel. Based on the DNA synthetic 
capability, and the correct size band in the gel, fi-actions 56-65 containing Pol lU-L 
polymerase were pooled (22 ml, 31 mg). The pooled fi-actions were dialyzed 
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overnight at 4°C against 50 mM phosphate (pH 7.6), 5 mM DTT, O.I mM EDTA, 2 
mM PMSF, and 20 % glycerol (P-cell buffer). The dialyzed pool was loaded onto a 
4.5 ml phosphocellulose column equilibrated in P-cell buffer, and then eluted with a 
25 ml linear gradient of P-cell buffer from 0 M NaCl to 0.5 M NaCl. Fractions of 1 
ml were collected and analyzed in a SDS polyacrylamide gel stained with Coomassie 
Blue (Figure 2C). Fractions 20-36 contained the majority of the Pol Hl-large at a 
purity of greater than 90 % (5 mg). 



Example 7 - S. aureus Pol III-L is Not Processive on its Own 



The Pol m-L polymerase purifies from B. subtilis as a single subunit 
without accessory factors (Barnes et al., "Purification of DNA Polymerase HI of 
Gram-positive Bacteria." Methods in Enzv. . 262:35-42 (1995), which is hereby 
incorporated by reference). Hence, it seemed possible that it may be a Type I 
replicase (e.g., like T5 polymerase) and, thus, be capable of extending a single primer 
full length around a long singly primed template. To perform this experiment, a 
template M13mpl8 ssDNA primed with a single DNA oligonucleotide either in the 
presence or absence of SSB was used. DNA products were analyzed in a neutral 
agarose gel which resolved products by size. The results showed that Pol HI-L 
polymerase was incapable of extending the primer around the DNA (to form a 
completed duplex circle referred to as replicative form II ("RFII")) whether SSB was 
present or not. This experiment has been repeated using more enzyme and longer 
times, but no full length RFD products are produced. Hence, Pol HI-L would appear 
not to follow the paradigm of the T5 system (Type I replicase) in which the 
polymerase is efficient in synthesis in the absence of any other protein(s). 

Example 8 - Cloning and Purification of 5. aureus Beta Subunit 

The sequence of an S. aureus homolog of the E. coli dnaN gene 
(encoding the beta subunit) was obtained in a study in which the large recF region of 
DNA was sequenced (Alonso et al., "Nucleotide Sequence of the recF Gene Cluster 
From Staphylococcus aureus and Complementation Analysis in Bacillus subtilis recF 
Mutants," Mol. Gen. Genet., 246:680-686 (1 995), Alonso et al., 'TSTucleotide 
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Sequence of the recF Gene Cluster From Staphylococcus aureus and 
Complementation Analysis in Bacillus subtilis recF Mutants," Mol. Gen. Genet , 
248:635-636 (1 995), which are hereby incorporated by reference). Sequence 
alignment of the S. aureus beta and E. coli beta show approximately 30% identity. 
Overall this level of homology is low and makes it uncertain that S. aureus beta will 
have the same shape and function as the E. coli beta subunit. 

To obtain S. aureus beta protein, the dnaN gene was isolated and 
precisely cloned into a pET vector for expression in E coli. S. aureus genomic DNA 
was used as template to amplify the homolog of the dnaN gene (encoding the putative 
beta). The upstream and downstream primers were designed to isolate the dnaN gene 
by PCR amplification from genomic DNA. Primers were: 

Upstream (SEQ. E). No. 41) 

cgactggaag gagttttaac atatg atgga attcac 

Downstream (SEQ. ID. No. 42) 

ttatat ggat ccttagtaag ttctgattgg 3q 

The Ndel site used for cloning into pET16b (Novagen) is underlined in the Upstream 
primer and the BamHI site used for cloning into pET16b is underlined in the 
Downstream primer. The Ndel and BamHI sites were used for directional cloning 
into pET16 (Figure 3). Amplification was performed using 500 ng genomic DNA. 0.5 
raM dNTPs, 1 pM of each primer, ImM MgS04, 2 units vent DNA polymerase in 100 
ul of vent buffer. Forty cycles were performed using the following cycling scheme: 
94«'C, 1 min; 60°C, 1 min.; 720C, 1 min. 10s. The 1 167 bp product was digested with 
Ndel and BamHI and purified in a 0.7 % agarose gel. The pure digested fragment was 
ligated into the pET16b vector which had been digested with Ndel and BamHI and gel 
purified in a 0.7% agarose gel. Ligated products were transformed into R coli 
competent SURE H cells (Stratagene) and colonies were screened for the correct 
chimera by examining minipreps for proper length and correct digestion products 
using Ndel and BamHI. 

24 L of of BL21(DE3)pETbeta cells were grown in LB containing 50 
Hg/ml ampicillin at 37°C to an O.D. of 0.7, and. then, the temperature was lowered to 
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1 S^'C. IPTG was added to a concentration of 2 mM and after a further 1 8 h at 1 5°C to 
induce expression ofS. aureus beta (Figure 4A). It is interesting to note that the beta 
subunit, when induced at 37°C, was completely insoluble. However, induction of 
cells at 15°C provided strong expression of beta and, upon cell lysis, over 50% of the 
beta was present in the soluble fraction. 

Cells were harvested by centrifugation (44 g wet weight) and stored at - 
70°C. The following steps were performed at 4X. Cells (44 g wet weight) were 
thawed and resuspended in 45 ml IX binding buffer (5 mM imidizole, 0.5 M NaCl, 20 
niM Tris HCl (final pH 7.5)) using a dounce homogenizer. Cells were lysed using a 
French Pressure cell (Aminco) at 20,000 psi, and then 4.5 ml of 10 % polyamine P 
(Sigma) was added. Cell debris and DNA was removed by centrifugation at 13,000 
rpm for 30 min. at 4''C. The pET16beta vector places a 20 residue leader containing 
10 histidine residues at the N-terminus of beta. Hence, upon lysing the cells, the 
S. aureus beta was greatly purified by chromatography on a nickel chelate resin 
(Figure 4B). The supernatant (890 mg protein) was applied to a 10 ml HiTrap 
Chelating Separose column (Pharmacia-LKB) equilibrated in binding buffer. The 
column was washed with binding buffer, then eluted with a 100 ml linear gradient of 
60 mM imidazole to 1 M imidazole in binding buffer. Fractions of 1 .35 ml were 
collected. Fractions were analyzed for the presence of beta in an SDS polyacrylamide 
gel stained with Coomassie Blue. Fractions 28-52, containing most of the beta 
subunit, were pooled (35 ml, 82 mg). Remaining contaminating protein was removed 
by chromatography on MonoQ. The S, aureus beta becomes insoluble as the ionic 
strength is lowered and, thus, the pool of beta was dialyzed overnight against Buffer A 
containing 400 mM NaCl. The dialyzed pool became slightly turbid indicating it was 
at its solubility limit at these concentrations of protein and NaCL The insoluble 
material was removed by centrifugation (64 mg remaining) and, then, diluted 2-fold 
with Buffer A to bring the conductivity to 256. The protein was then applied to an 8 
ml MonoQ column equilibrated in Buffer A plus 250 mM NaCl and then eluted with a 
100 ml linear gradient of Buffer A from 0.25M NaCl to 0.75 M NaCl; fractions of 
1 .25 ml were collected (Figure 4C). . Under these conditions, approximately 27 mg of 
the beta flowed through the column and the remainder eluted in fractions 1-18 
(24 mg). 
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Example 9 - The aureus Beta Subunit Protein Stimulates 5. aureus Pol III-L 
and E. coli Core 

The experiment of Figure 5 A, tests the ability of S. aureus beta to 
5 stimulate S. aureus Pol III-L on a linear polydA-oligodT template. Reactions are also 

performed with E, coli beta and Pol EI core. The linear template was polydA of 
average length of 4500 nucleotides primed with a 30mer oligonucleotide of T 
residues. The first two lanes show the activity of Pol ni-L either without (lane 1) or 
with S, aureus beta (lane 2). The result shows that the S. aureus beta stimulates Pol 

10 m-L approximately 5-6 fold. Lanes 5 and 6 show the corresponding experiment using 

E. coli core with (lane 6) or without (lane 5) E. coli beta. The core is stimulated over 
10-fold by the E, coli beta subunit under the conditions used. 

Although Gram positive and Gram negative cells diverged fi-om one 
another long ago and components of one polymerase machinery would not be 

15 expected to be interchangable, it was decided to test the activity of the S. aureus beta 
with E, coli Pol in core. Lanes 3 and 4 shows that the S, aureus beta also stimulates 
£. coli core about 5-fold. This result can be explained by an interaction between the 
clamp and the polymerase that has been conserved during the evolutionary divergence 
of gram positive and gram negative cells. A chemical inhibitor that would disrupt this 

20 interaction would be predicted to have a broad spectrum of antibiotic activity, shutting 

down replication in Gram negative and Gram positive cells alike. This assay, and 
others based on this interaction, can be devised to screen chemicals for such 
inhibition. Further, since all the proteins in this assay are highly overexpressed 
through recombinant techniques, sufficient quantities of the protein reagents can be 

25 obtained for screening hundreds of thousands of compounds. 

In summary, the results show that S. aureus beta, produced in E. coli, is 
indeed an active protein (i.e., it stimulates polymerase activity). Furthermore, the 
results shows that Pol ni-L functions with a second protein (i.e., S. aureus beta). 
Before this experiment, there was no assurance that Pol ni-L, which is significantly 

30 different in structure firom E, coli alpha, would function with another protein. For 
example, unlike E, coli alpha, which copurifies with several accessory proteins, Pol 
ni-L purified fi"om B. subtilis as a single protein with no other subunits attached 
(Barnes et al., "Purification of DNA Polymerase HI of Gram-positive Bacteria,'* 
Methods in Enzv. , 262:35-42 (1995), which is hereby incorporated by reference). 
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Finally, if one were to assume that S. aureus beta would function with a polymerase, 
the logical candidate would have been the product of the dnaE gene (alpha-small) 
instead of polC (Pol IH-L) since the dnaE product is more homologous to E. coli alpha 
subunit than Pol III-L. 

5 

Example 10 - The S. aureus Beta Subunit Behaves as a Circular Sliding Clamp 

The ability of S. aureus beta to stimulate Pol HI-L could be explained 
by formation of a 2-protein complex between Pol HI-L and beta to form a processive 

10 replicase similar to the Type U class (e.g., T7 type). Alternatively, the S. aureus 

replicase is organized as the Type DI rephcase which operates with a circular sliding 
clamp and a clamp loader. In this case, the S, aureus beta would be a circular protein 
and would require a clamp loading apparatus to load it onto DNA. The ability of the 
beta subunit to stimulate Pol ni-L in Figure 5A could be explained by the fact that the 

15 polydA-oligodT template is a linear DNA and a circular protein could thread itself 

onto the DNA over an end. Such "end threading" has been observed with PCNA and 
explains its ability to stimulate DNA polymerase delta in the absence of the RFC 
clamp loader (Burgers et al., "ATP-Independent Loading of the Proliferating Cell 
Nuclear Antigen Requires DNA Ends," J, BioL Chem. , 268:19923-19926 (1993), 

20 which is hereby incorporated by reference). 

To distinguish between these possibilities, S. aureus beta was 
examined for ability to stimulate Pol UI-L on a circular primed template. In 
Figure 5B, assays were performed using circular M13mpl8 ssDNA coated with 
E. coli SSB and primed with a single oligonucleotide to test the activity of beta on 

25 circular DNA. Lane 1 shows the extent of DNA synthesis using Pol III-L alone. In 
lane 2, Pol EI-L was supplemented with S. aureus beta. The S, aureus beta did not 
stimulate the activity of Pol ni-L on this circular DNA (nor in the absence of SSB). 
Inability of aureus beta to stimulate Pol EI-L is supported by the results of Figure 6, 
lane 1 that analyzes the product of Pol III-L action on the circular DNA in an agarose 

30 gel in the presence of 5. aureus beta. In summary, these results show that S. aureus 

beta only stimulates Pol III-L on linear DNA, not circular DNA. Hence, the 5. aureus 
beta subunit behaves as a circular protein. 
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Lane 3 shows the result of adding both iS". aureus beta and E. coli 
gamma complex to Pol III-L. Again, no stimulation was observed (compare with lane 
1). This result indicates that the functional contacts between the clamp and clamp 
loader were not conserved during evolution of Gram positive and Gram negative cells. 
5 Controls for these reactions on circular DNA are shown for the E. coli 

system in Lanes 4-6. Addition of only beta to E, coli Pol ID core did not result in 
stimulating the polymerase (compare lanes 4 and 5). However, when clamp loader 
complex was included with beta and core, a large stimulation of synthesis was 
observed (lane 6). Li summary, stimulation of synthesis is only observed when both 
10 beta and clamp loader complex were present, consistent with inability of the circular 
beta ring to assemble onto circular DNA by itself. 

Example 11 - Pol III-L Functions as a Pol Ill-Type Replicase with Beta and a 
Clamp Loader Complex to Become Processive 

15 

Next, it was determined whether S, aureus Pol III-L requires two 
components (a beta clamp and a clamp loader) to extend a primer full length around a 
circular primed template. In Figure 6, a template circular Ml 3mpl 8 ssDNA primed 
with a single DNA oligonucleotide was used. DNA products were analyzed in a 

20 neutral agarose gel which resolves starting materials (labeled ssDNA in Figure 6) 
from completed duplex circles (labelled RFII for replicative form II). The first two 
lanes show, as demonstrated in other examples, that Pol IH-L is incapable of 
extending the primer around the circular DNA in the presence of only S. aureus beta. 
In lane 4 of Figure 6, E. coli clamp loader complex (also known as gamma complex) 

25 and beta subunit were mixed with S. aureus Pol III-L in the assay containing singly 

primed M13mpl 8 ssDNA coated with SSB. If the beta clamp, assembled on DNA by 
clamp loader complex, provides processivity to S. aureus Pol m-L, the ssDNA circle 
should be converted into a fully duplex circle (RFII) which would be visible in an 
agarose gel analysis. The results of the experiment showed that the E. coli beta and 

30 clamp loader complex did indeed provide Pol III-L with ability to fully extend the 

primer around the circular DNA to form the RFII (lane 4). The negative control using 
only coli clamp loader complex and beta is shown in lane 3. For comparison, lane 
6 shows the result of mixing the three components of the £. coli system (Pol HI core, 
beta, and clamp loader complex). This reaction gives almost exclusively full length 
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RFD product. The qualitatively different product profile that Pol EQ-L gives in the 
agarose gel analysis compared to E. coli Pol EQ core with beta and clamp loader 
complex shows that the products observed using Pol III-L is not due to a contaminant 
of E, coli Pol ni core in the S. aureus Pol III-L preparation (compare lanes 4 and 6). 
5 It is generally thought that the polymerase of one system is specific for 

its SSB. However, these reactions are performed on ssDNA coated with the E. coli 
SSB protein. Hence, the S. aureus Pol ni-L appears capable of utilizing E, coli SSB 
and the E. coli beta. It would appear that the only component that is not 
interchangeable between the Gram positive and Gram negative systems is the clamp 
10 loader complex. 

Thus, the S. aureus Pol HI-L functions as a Pol EH type replicase with 
the E, coli beta clamp assembled onto DNA by a clamp loader complex. 

Example 12 - Purification of Two DNA Polymerase Ill-Type Enzymes From 
15 S. aureus Cells 

The MonoQ resin by Pharmacia has very high resolution which would 
resolve the three DNA polymerases of S, aureus. Hence, S. aureus cells were lysed, 
DNA was removed from the lysate, and the clarified lysate was applied onto a MonoQ 

20 colunm. The details of this procedure are: 300 L of S. aureus (strain 4220, a gift of 
Dr. Pat Schlievert, University of Minnisota) was grown in 2X LB media at 37°C to an 
O.D. of approximately 1 .5 and then were collected by centrifugation. Approximately 
2 kg of wet cell paste was obtained and stored at -TO'^C. 122 g of cell paste was 
thawed and resuspended in 192 ml of cell lysis buffer followed by passage through a 

25 French Press cell (Aminco) at 40,000 psi. The resuUant lysate was clarified by high 
speed centrifugation (1.3 g protein in 120 ml). A 20 ml aliquot of the supernatant was 
dialyzed 2 h against 2 L of buffer A containing 50 mM NaCl. The dialyzed material 
(148 mg, conductivity =101 mM NaCl) was diluted 2-fold with Buffer A containing 
50 mM NaCl and then loaded onto an 8 ml MonoQ column equilibrated in Buffer A 

30 containing 50 mM NaCl. The column was washed with Buffer A containing 50 mM 
NaCl, and then eluted with a 160 ml linear gradient of 0.05 M NaCl to 0.5 M NaCl in 
Buffer A. Fractions of 2.5 ml (64 total) were collected, followed by analysis in an 
SDS polyacrylamide gel for their replication activity in assays using calf thymus 
DNA. 
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Three peaks of DNA polymerase activity were identified (Figure 7). 
Previous studies of cell extracts prepared from the Gram positive organism Bacillus 
subtilis identified only two peaks of activity off a DEAE column (similar charged 
resin to MonoQ). The first peak was Pol II, and the second peak was a combination of 
DNA polymerases I and m. The DNA polymerases I and HI were then separated on a 
subsequent phosphocellulose column. The middle peak in Figure 7 is much larger 
than the other two peaks and, thus, it was decided to chromatograph this peak on a 
phosphocellulose column. The second peak of DNA synthetic activity was pooled 
(fractions 37-43; 28 mg in 14 ml) and dialyzed against 1.5 L P-cell buffer for 2.5 h. 
Then, the sample (ionic strength equal to 99 mM NaCl) was applied to a 5 ml 
phosphocellulose column equilibrated in P-cell buffer. After washing the column in 
10 ml P-cell buffer, the column was eluted with a 60 ml gradient of 0 - 0.5 M NaCl in 
P-cell buffer. Seventy fractions were collected and then analyzed for DNA synthesis 
using calf thymus DNA as template. This column resolved the polymerase activity 
into two distinct peaks (Figure 7B). 

Hence, there appear to be four DNA polymerases in Staphylococcus 
aureus. They were designated here as peak 1 (first peak ofT MonoQ), peak 2 (first 
peak off phosphocellulose), peak 3 (second peak of phosphocellulose), and peak 4 
(last peak off Mono Q) (see Figure 7). Peak 4 was presumably Pol HI-L, as it elutes 
from MonoQ in a similar position as the Pol m-L expressed in E, coli (compare 
Figure 7A with Figure 2). 

Example 13 - Demonstration That Peak 1 (Pol 111-2) Functions as a Pol Ill-Type 
Replicase With E, coli Beta Assembled on DNA by E, coli Clamp 
Loader Complex. 

To test which peak contained a Pol El-type of polymerase, an assay 
was used in which the E. coli clamp loader complex and beta support formation of full 
length RFn product starting from E, coli SSB coated circular Ml 3mpl 8 ssDNA 
primed with a single oligonucleotide. In Figure 8, both Peaks 1 and 2 are stimulated 
by the £. coli clamp loader complex and beta subunit and, in fact. Peaks 2 and 3 are 
inhibited by these proteins (the quantitation is shovra below the gel in the figure). 
Further, the product analysis in the agarose gel shows full length RFII duplex DNA 
circles only for peaks 1 and 4. These results, combined with the NEM, pCMB, and 
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KCl characteristics in Tables 2 and 3 below, suggest that there are two Pol Hl-type 
DNA polymerases in S, aureus and that these are partially purified in peaks 1 and 4, 

Next, it was determined which of these peaks of DNA polymerase 
activity correspond to DNA polymerases I, n, and ID, and which peak is the 
unidentified DNA polymerase. In the Gram postive bacterium B. subtilis, Pol m is 
inhibited by pCMB, NEM, and 0. 1 5 M NaCl, Pol n is inhibited by KCl, but not NEM 
or 0.15 M KCL, and Pol 1 is not inhibited by any of these treatments (Gass et al., 
"Further Genetic and Enzymological Characterization of the Three Bacillus subtilis 
Deoxyribonucleic Acid Polymerases," J. Biol. Chem. . 248:7688-7700 (1973), which 
is hereby incorporated by reference). Hence, assays were performed in the presence or 
absence of pCMB, NEM, and 0.15 M KCl (see Tables 2 and 3 below). Peak 3 clearly 
corresponded to Pol I, because it was not inhibited by NEM, pCMB, or 0. 1 5 M NaCl. 
Peak 2 correspond to Pol n, because it was not inhibited by NEM, but was inhibited 
by pCMB and 0, 1 5 M NaCl. Peaks 1 and 4 both had characteristics that mimic Pol 
m; however, peak 4 elutes on MonoQ at a similar position as Pol EI-L expressed in £. 
coli (see Figure 2B). Hence, peak 4 is likely Pol EI-L, and peak 1 is likely the 
unknown polymerase. 



Table 2: Expected Characteristics of Polymer ases 
Polymerase pCMB NEM 

I not inhibited* not inhibited 

° inhibited** not inhibited 

^oXm-h inhibited inhibited 



0.15M KCl 
not inhibited 
not inhibited 
not inhibited 



Not inhibited is defined as greater than 75% remaining activity 
Inhibited is defined as less than 40% remaining activity 



Table 3: Observed Characteristics 



Peak 
Peak I 
Peak2 
Peak3 
Peak4 



pCMB 
inhibited 
inhibited 
not inhibited 
inhibited 



NEM 
inhibited 
not inhibited 
not inhibited 
inhibited 



0.1 5M KCL assienment 
new polymerase 
Poin 
Poll 

Pol m-L 
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KCl characteristics in Tables 2 and 3 below, suggest that there are two Pol Ill-type 
DNA polymerases in S. aureus and that these are partially purified in peaks 1 and 4. 



activity correspond to DNA polymerases I, H, and ID, and which peak is the 
unidentified DNA polymerase. In the Gram postive bacterium 5. subtilis, Poi m is 
inhibited by pCMB, NEM, and 0.15 M NaCl, Pol D is inhibited by KCl, but not NEM 
or 0.15 M KCL, and Pol I is not inhibited by any of these treatments (Gass et al., 
"Further Genetic and Enzymological Characterization of the Three Bacillus subtilis 
Deoxyribonucleic Acid Polymerases " J. BioL Chem. , 248:7688-7700 (1973), which 
is hereby incorporated by reference). Hence, assays were performed in the presence or 
absence of pCMB, NEM, and 0. 1 5 M KCl (see Tables 2 and 3 below). Peak 3 clearly 
corresponded to Pol I, because it was not inhibited by NEM, pCMB, or 0.15 M NaCl. 
Peak 2 correspond to Pol H, because it was not inhibited by NEM, but was inhibited 
by pCMB and 0.15 M NaCl. Peaks 1 and 4 both had characteristics that mimic Pol 
ni; however, peak 4 elutes on MonoQ at a similar position as Pol HI-L expressed in £ 
coli (see Figure 2B). Hence, peak 4 is likely Pol HI-L, and peak 1 is likely the 
unknown polymerase. 

Table 2: Expected Characteristics of Polymera ses 

Polymerase pCMB NEM 0.15M KCl 



Next, it was determined which of these peaks of DNA polymerase 



Poll 



not inhibited* not inhibited 
inhibited* * not inhibited 

inhibited inhibited 



not inhibited 



Poin 
Pol m-L 



not inhibited 
not inhibited 



Not inhibited is defined as greater than 75% remaining activity 
Inhibited is defined as less than 40% remaining activity 



Table 3: Observed Characteristics 




NEM 



0.1 5M KCL assignment 
new polymerase 



Peak! inhibited 
Feak2 inhibited 



inhibited 
not inhibited 
not inhibited 
inhibited 



Poin 
Poll 

Pol m-L 



Peaks not inhibited 

Peak4 inhibited 
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Example 14 - Identification and Cloning of 5. aureus dnaE 

This invention describes the finding of two DNA polymerases that 
function with a sHding clamp assembled onto DNA by a clamp loader. One of these 
5 DNA polymerases is likely Pol III-L, but the other has not been identified previously. 
Presumably, the chromatographic resins used in earlier studies did not have the 
resolving power to separate the enzyme jfrom other polymerases. This would be 
compounded by the low activity of Pol in-2. To identify a gene encoding the second 
Pol in, the amino acid sequences of the Pol IH alpha subunit of Escherichia coli, 
10 Salmonella typhimurium. Vibrio cholerae, Haemophilis influenzae^ and Helicobacter 

pylori were aligned using Clustal W (1.5). Two regions about 400 residues apart were 
conserved and primers were designed for the following amino acid sequences: 

Upstream, corresponding in E. coli to residues 385-399 (SEQ. ID. No. 43) 

15 Leu Leu Phe Glu Arg Phe Leu Asn Pro Glu Arg Val Ser Met Pro 

15 10 15 

Downstream, corresponding in £. coli to residues 750-764 (SEQ. ID. No. 44) 

Lys Phe Ala Gly Tyr Gly Phe Asn Lys Ser His Ser Ala Ala Tyr 
20 1 5 10 15 

The following primers were designed to these two peptide regions using codon 
preferences for 5*. aureus: 

25 Upstream (SEQ. ID. No. 45) 

cttctttttg aaagatttct aaataaagaa cgttattcaa tgcc 44 

Downstream (SEQ. ID. No. 46) 

ataagctgca gcatgacttt tattaaaacc ataacctgca aattt 45 

30 

Amplification was performed using 2.5 units of Tag DNA Polymerase (Gibco, BRL), 
100 ng S, aureus genomic DNA, 1 mM of each of the four dNTPs, 1 of each 
primer, and 3 mM MgCk in 100 ^1 of buffer. Thirty-five cycles of the following 
scheme were repeated: 94*^0, 1 min; 55°C, 1 min; 72°C, 90 sec. The PGR product 
35 (approximately 1 .1 kb) was electrophoresed in a 0.8 % agarose gel and purified using 
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a Geneclean III kit (Bio 101). The product was then divided equally into ten separate 
aliquots and used as a template for PGR reactions, according to the above protocol, to 
reamplify the fragment for sequencing. The final PGR product was purified using a 
Quiagen Quiaquick PGR Purification kit, quantitated via optical density at 260 nM, 
and sequenced by the Protein/DNA Technology Center at Rockefeller University. The 
same primers used for PGR were used to prime the sequencing reactions. 

Next, the following additional PGR primers were designed to obtain 
more sequence information 3' to the first amplified section. 

Upstream (SEQ. DD. No. 47) 

agttaaaaat gccatatttt gacgtgtttt agttctaat 39 

Downstream (SEQ. ID. No. 48) 

cttgcaaaag cggttgctaa agatgttgga cgaattatgg gg 42 

These primers were used in a PGR reaction using 2.5 units of Tag DNA Polymerase 
(Gibco, BRL) with 100 ng X aureus genomic DNA as a template, ImM dNTP's, 
1 nM of each primer, and 3 mM MgGh in 100 1 of Taq buffer. Thirty-five cycles of 
the following scheme were repeated: 94°C, 1 min; 55°G, 1 min; 72°G, 2 min 30 
seconds. The 1.6 Kb product was then divided into 5 aliquots, and used as a template 
in a set of 5 PGR reactions, as described above, to amplify the product for sequencing. 
The products of these reactions were purified using a Qiagen Qiaquick PGR 
Purification kit, quantitated via optical density at 260 nm, and sequenced by the 
Protein/DNA Technology Center at Rockefeller University. The sequence of this 
product yielded about 740 bp of new sequence 3' of the first sequence. 

As this gene shows better homology to the Gram negative Pol III a 
subunit compared to Gram positive Pol ID-L, it will be designated the dnaE gene. 

Example 15 - Identification and Cloning of 5. aureus dnaX 

The fact that the S. aureus beta stimulates Pol III-L and has a ring 
shape suggests that the Gram postive replication machinery is of the three component 
type. This implies the presence of a clamp loader complex. This is not a simple 
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determination to make as the fl. subtilis genome shows homologs to only two of the 
five subunits of the E. coli clamp loader (rfwoA^ encoding gamma, and holB encoding 
delta prime). On the basis of the experiments in this appHcation, which suggests that 
there is a clamp loader, it was believed that these two subunit homologues are part of 
the clamp loader for the S. aureus beta. 

As a start in obtaining the clamp loading apparatus, a strategy was 
devised to obtain the gene encoding the tau subunit ofS. aureus. In E, coli, the tau 
and gamma subunits are derived from the same gene. Tau is the full length product, 
and gamma is about 2/3 the length of tau. Gamma is derived from the <fwaZgene by 
what was originally believed to be an efficient translational frameshift mechanism 
that, after it occurs, incorporates only one unique C-terminal residue before 
encountering a stop codon. To identify the dnaXgm^ ofS, aureus by PCR analysis, 
the rfwoA^ genes of 5. subtilis, E. coli, and K influenzae were aligned. Upon 
comparison of the amino acid sequence encoded by these dnaXg^nes, two areas of 
high homology were used to predict the amino acid sequence of the S. aureus dnaX 
gene product. PCR primers were designed to these sequences, and a PCR product of 
the expected size was indeed produced. DNA primers were designed to two regions 
of high similarity for use in PCR that were about 1 00 residues apart. The amino acid 
sequences of these regions were: 



Upstream, corresponding to residues 39-48 of E. coli (SEQ. E). No. 49) 

His Ala Tyr Leu Phe Ser Gly Pro Arg Gly 
15 10 

Downstream, corresponding to residues 138-148 of £. coli (SEQ. ID. No. 50) 
His Ala Tyr Leu Phe Ser Gly Pro Arg Gly 

The DNA sequence of the PCR primers was based upon the codon usage oiS, 
The primers are as follows: 



Upstream (SEQ. ID. No. 51) 

cgc ggatcc c atgcatattt attttcaggt ccaagagg 



38 
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Downstream (SEQ. ID. No. 52) 

ccg gaattc t ggtggttctt ctaatgtttt taataatgc 39 

The first 9 nucleotides of the upstream primer (SEQ. ID. No. 51) contain a BamHI 
site, which is underlined, and do not correspond to amino acid codons; the 3' 29 
nucleotides correspond to the amino acid sequence of SEQ. ID. No. 49. The EcoRl 
site of the downstream primer (SEQ. ID. No. 52) is underlined and the 3' 33 
nucleotides correspond to the amino acid sequence of SEQ. ID. No. 50. 

The expected PCR product, based on the alignment, is approximately 
268 bp between the primer sequences. Amplification was performed using 500 ng 
genomic DNA, 0.5 mM dNTPs, 1 of each primer, 1 mM MgS04, 2 units vent 
DNA polymerase in 100 pi of vent buffer. Forty cycles were performed using the 
following cycling scheme: 94°C, 1 min; 60^C, 1 min.; 72X, 30s. The approximately 
300 bp product was digested with EcoRI and BamHI and purified in a 0.7 % agarose 
gel. The pure digested fragment was ligated into pUCl 8 which had been digested 
with EcoRI and BamHI and gel purified in a 0.7 % agarose gel. Ligated products 
were transformed into E. coli competent DH5a cells (Stratagene), and colonies were 
screened for the correct chimera by examining minipreps for proper length and correct 
digestion products using EcoRI and BamHI. The sequence of the insert was 
determined and was found to have high homology to the c/noA^ genes of several 
bacteria. This sequence was used to design circular PCR primers. Two new primers 
were designed for circular PCR based on this sequence. 

A circular PCR product of approximately 1 .6 kb was obtained from a 
Hincn digest of chromosomal DNA that was recircularized with ligase. This first 
circular PCR yielded most of the remaining dnaX gene. The two primers were as 
follows: 

Rightward fSEO. ID. No. 53) 

tttgtaaagg cattacgcag gggactaatt cagatgtg 33 



Leftward (SEQ. ID. No. 54) 

tatgacattc attacaaggt tctccatcag tgc 



33 
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Genomic DNA (3 \xg) was digested with HincD, purified with phenol/chloroform 
extraction, ethanol precipitated and redissolved in 70 ^il T.E. buffer. The genomic 
DNA was recircularized upon adding 4000 units T4 Hgase (New England Biolabs) in 
a final volume of 100 ^1 T4 ligase buffer (New England Biolabs) at IC^'C overnight. 
5 The PCR reaction consisted of 90 ng recircularized genomic DNA, 0.5 mM each 

dNTP, 100 pmol of each primer, 1.4 mM magnesium sulfate, and 1 unit of elongase 
(GIBCO) in a final volume of 100 nl elongase buffer (GIBCO). 40 cycles were 
performed using the following scheme: 94°C, 1 min.; 55°C, 1 min.i and 68^C, 2 min. 
The resulting PCR product was approximately 1 .6 kb. The PCR product was purified 

10 from a 0.7 % agarose gel and sequenced directly. A stretch of approximately 750 

nucleotides was obtained using the rightward primer used in the circular PCR 
reaction. To obtain the rest of the sequence, other sequencing primers were designed 
in succession based on the information of each new sequencing run. 

This sequence, when spliced together with the previous 300 bp PCR 

15 sequence, contained the complete N-terminus of the gene product (stop codons are 

present upstream) and possibly lacked only about 50 residues of the C-terminus. The 
amino terminal region of E. colt tau shares what appears to be the most conserved 
region of the gene as this area shares homology with RFC subunit of the human clamp 
loader and with the gene 44 protein of the phage T4 clamp loader. An alignment of 

20 the N-terminal region of the S. aureus tau protein with that of B, subtilis and E. coli is 

shown in Figure 10. Among the highly conserved residues are the ATP binding site 
consensus sequence and the four cystine residues that form a Zn^^ finger. 

After obtaining 1 kb of sequence in the 5' region of dnaX, it was 
sought to determine the remaining 3* end of the gene. Circular PCR products of 

25 approximately 800bps, 600bps, and 1600bps were obtained from Apo I, or Nsi I or 

Ssp I digest of chromosomal DNA that were recircularized with ligase. 

Rightward (SEQ. ID. No. 55) 

gagcactgat gaacttagaa ttagatatg 29 



Leftward (SEQ. ID. No. 56) 

gatactcagt atctttctca gatgttttat tc 



32 
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Genomic DNA (3 g) was digested with, Apo I, or Nsi I or Ssp I, purified with 
phenol/chloroform extraction, ethanol precipitated, and redissolved in 70 1 T.E. buffer. 
The genomic DNA was recircularized upon adding 4000 units of T4 ligase (New 
England Biolabs) in a final volume of 100 1 T4 ligase buffer (New England Biolabs) at 
16°C overnight. The PGR reaction consisted of 90 ng recircularized genomic DNA, 
0.5 mM each dNTP, 100 pmol of each primer, 1 .4 mM magnesium sulfate, and 1 unit 
of elongase (GIBGO) in a final volume of 100 1 elongase buffer (GBCO). 40 cycles 
were perfomied using the following scheme: 94^G, 1 min.; SS'^G, 1 min.; 68°G, 2 min. 
The PCR products were directly cloned into pGR II TOPO vector using the TOPO TA 
cloning kit (Invitrogen Gorporation) for obtaining the rest of the G terminal sequence 
of S. aureus dnaX, DNA sequencing was performed by the Rockefeller University 
sequencing facility. 

Example 16 - Identificatioii and Cloning of ^. aureus dnaB 

In E, coli, the DnaB helicase assembles with the DNA polymerase HI 
holoenzyme to form a replisome assembly. The DnaB helicase also interacts directly 
with the primase to complete the machinery needed to duphcate a double helix. As a 
first step in studying how the S, aureus helicase acts with the replicase and primase, S. 
aureus was examined for presence of si dnaB gene. 

The amino acid sequences of the DnaB helicase of Escherichia coli. 
Salmonella typhimurium, Haemophilis influenzae, and Helicobacter pylori were 
aligned using Glustal W (1.5). Two regions about 200 residues apart showed good 
homology. These peptide sequences were: 

Upstream, corresponding to residues 225-238 ofE. coli DnaB (SEQ. ID. No. 57) 

Asp Leu lie lie Val Ala Ala Arg Pro Ser Met Gly Lys Thr 
15 10 

Downstream, corresponding to residues 435-449 of coli DnaB (SEQ. ID. No. 58) 

Glu He He He Gly Lys Gin Arg Asn Gly Pro He Gly Thr Val 
15 10 15 

The following primers were designed from regions which contained conserved 
sequences using codon preferences for 5, aureus: 
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Upstream (SEQ. ID. No. 59) 

gaccttataa ttgtagctgc acgtccttct atgggaaaaa c 41 

5 Downstream (SEQ. ID. No. 60) 

aacattatta agtcagcatc ttgttctatt gatccagatt caacgaag 48 

A PGR reaction was carried out using 2.5 units of Taq DNA Polymerase (Gibco, 
BRL) with 100 ng. S. aureus genomic DNA as template, 1 mM dNTP's, \\iU of each 

10 primer, 3 mM MgCl2 in 100 ^1 of Taq buffer. Thirty- five cycles of the following 
scheme were repeated: 94'*C, 1 min.; 55°C, 1 min.; and 72°C, 1 min. Two PGR 
products were produced, one was about 1.1 kb, and another was 0.6 kb. The smaller 
one was the size expected. The 0.6 kb product was gel purified and used as a template 
for a second round of PGR as follows. The 0.6 kb PGR product was purified fi-om a 

15 0.8% agarose gel using a Geneclean IE kit (Bio 1 01) and then divided equally into 
five separate aliquots, as a template for PGR reactions. The final PGR product was 
purified using a Quiagen Quiaquick PGR Purification kit, quantitated via optical 
density at 260 nM, and sequenced by the Protein/DNA Technology Genter at 
Rockefeller University. The same primers used for PGR were used to prime the 

20 sequencing reaction. The amino acid sequence was determined by translation of the 
DNA sequence in all three reading fi-ames, and selecting the longest open reading 
frame. The PGR product contained an open reading frame over its entire length. The 
predicted amino acid sequence shares, homology to the amino acid sequences encoded 
by dnaB gene of other organisms. 

25 Additional sequence information was determined using the circular 

PGR technique. Briefly, 5. aureus genomic DNA was digested with various 
endonucleases, then religated with T4 DNA ligase to form circular templates. To 
perform PGR, two primers were designed fi-om the initial sequence. 

30 First primer (SEQ. K). No. 6 1 ) 

gatttgtagt tctggtaatg ttgactcaaa ccgcttaaga accgg 45 



Second primer (SEQ. ED. No. 62) 

atacgtgtgg ttaactgatc agcaacccat ctctagtgag aaaatacc 



48 
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The first primer matches the sequence of the coding strand and the second primer 
matches the sequence of the complementary strand. These two primers are directed 
outwards from a central point, and allow determination of new sequence infoimation 
5 up to the ligated endonuclease site. A PCR product of approximately 900 bases in 

length was produced using the above primers and template derived firom the ligation 
ofS. aureus genomic DNA which had been cut with the restriction endonuclease Apo 
L This PCR product was electrophoresed in a 0.8% agarose gel, eluted with a Qiagen 
gel elution kit, divided into five separate aliquots, and used as a template for 

10 reamplification by PCR using the same primers as described above. The final product 
was electrophoresed in an 0.8% agarose gel, visuahzed via staining with ethidium 
bromide under ultraviolet light, and excised from the gel. The excised gel slice was 
fi^ozen, and centriftiged at 12,000 rpm for 15 minutes. The supernatant was extracted 
with phenol/chloroform to remove ethidium bromide, and was then cleaned using a 

15 Qiagen PCR purification kit. The material was then quantitated fi^om its optical 

density at 260 nm and sequenced by the Protein/DNA Technology Center at the 
Rockefeller University. 

The nucleotide sequence contained an open reading firame over its 
length, up to a sequence which corresponded to the consensus sequence of a cleavage 

20 site of the enzyme Apo 1. Following this point, a second open reading frame encoded 
a different reading fi-ame up to the end of the product. The inital sequence 
information was found to match the inital sequence and to extend it yet further 
towards the C-terminus of the protein. The second reading fi-ame was found to end in 
a sequence which matched the 5*-terminus of the previously determined sequence and, 

25 thus, represents an extension of the sequence towards the N-terminus of the protein. 

Additional sequence information was obtained using the above primers 
and a template generated using S. aureus genomic DNA circularized via ligation with 
T4 ligase following digestion with Cla L The PCR product was generated using 35 
cycles of the following program: denaturation at 94°C for 1 min.; annealing at 55°C 

30 for 1 min.; and extension at 68**C for 3 minutes and 30 s. The PCR products were 
electrophoresed in a 0,8% agarose gel, eluted with a Qiagen gel elution kit, divided 
into five separate aliquots, and used as a template reamplification via PCR with the 
same primers described above. The final product was electrophoresed in an 0.8% 
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agarose gel, visualized via staining with ethidium bromide under ultraviolet light, and 
excised from the gel. The excised gel slice was frozen, and centrifliged at 12,000 rpm 
for 15 min. The supernatant was cleaned using a Qiagen PGR purification kit. The 
material was then quantitated via optical density at 260 nm and sequenced by the 
Protein/DNA Technology Center at Rockefeller University. The open reading frames 
continued past 500 bases. Therefore, the following additional sequencing primers 
were designed from the sequence to obtain further information: 

First primer (SEP. ID. No. 63) 

cgttttaatg catgcttaga aacgatatca g 3;^ 
Second primer f SEP. ID. No. 64) 

cattgctaag caacgttacg gtccaacagg c 31 

The N-terminal and C-terminal nucleotide sequence extensions 
generated using this circular PGR product completed the 5' region of the gene 
(encoding the N-terminus of DnaB); however, a stop codon was not reached in the 3' 
region and, thus, a small amount of sequence is still needed to complete this gene. 

The alignment of the aureus dnaB with E. coli dnaB and the dnaB 
genes of A subtilis and S. typhimurium is shown in Figure 1 1 . 

Example 17 - Identification and Cloning of 5. aureus holB 

The S, aureus holB was identified by searching the S, aureus database 
with the sequences of S. pyogenes 5' subunit. The X aureus holB encodes a 253 
residue protein of about 28 kDa. The holB gene was amplified by PGR using an 
upstream 69-mer primer as follows: 

Upstream Primer (SEQ. ED. No. 65): 

ggataacaat tccccgctag caataatttt gtttaacttt aagaaggaga tatac ccatg 6C 
gatg aacaq 



which contains an Ncol site (underiined), and a downstream 39-mer primer as 
follows: 
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Downstream Primer (SEQ. ED. No. 66): 

aattttaaag gatcc gtgta taatattcta attttcccg 39 

5 which contains a BamHI site (underlined). The PCR product was digested with Ncol 

and BamHI, purified, and ligated into the Ncol and BamHI sites of pETl 1 a to produce 
plasmid pETSaholB. 

Example 18 * Punflcation of 5. aureus 5* 

10 

The pETSaholB plasmid of Example 17 was transformed into E. 
coli BL21(DE3)rec^. A single colony was used to innoculate 2L of LB media 
supplemented with 200 ^ig/ml ampicillin. Cells (2L) were grown at 3TC to 
OD600-0.5 at which point the temperature was lowered to 15°C and 0.5 mM IPTG 

15 was added. After 1 6 hr of induction, cells were collected by centrifugation, 

resuspended in 50 mM Tris-HCl (pH 7.5), 10% sucrose, 1 M NaCl, 30 mM 
spermidine, 5 mM DTT, and 2 mM EDTA. Cells were lysed by two passages through 
a French press (1 5,000 psi), followed by centrifugation at 1 3,000 rpm for 30 min at 
4*^C. Ammonium sulfate (0.3 g/ml) was added to the clarified lysate. The pellet was 

20 backwashed in 30 ml buffer A containing 0.1 M NaCl and 0.24 g/ml ammonium 

sulfate using a Dounce homogenizer, then the pellet was recovered by centrifugation. 
The resulting pellet was resuspended in 20 ml of buffer A and dialyzed against buffer 
A. The dialyzed protein was applied to a 20 ml FFQ Sepharose column equilibrated 
in buffer A and eluted with a 200 ml linear gradient of 0 - 500 mM NaCl in buffer A; 

25 80 fractions were collected. Peak fractions (54 - 75) were combined (72 mg) and 

dialyzed against buffer A. The 5' preparation was aliquoted and stored frozen at - 
80*^C. 

Example 19 - Identification and Cloning of 5. aureus holA 

30 

The S. aureus holA gene was identified by searching the S. aureus 
database with the sequences of E, coli and 5. pyogenes 5 subunits. The 5. aureus holA 
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gene encodes a 288 residue protein of about 32 kDa. The holA gene was amplified by 
PGR using an upstream 28-mer primer as follows: 

Upstream Primer (SEQ. ID. No. 67): 
5 gggagtttgt aat ccatgg a tgaacagc 2 8 

which contains a Ncol site (underlined), and a downstream 37-mer primer as follows: 

Downstream Primer (SEQ. ID. No. 68): 
10 ctgaacacct attac cctag gcatctaact cacaccc 37 

which contains a BamHI site (imderlined). The PGR product was digested with Ncol 
and BamHI, purified, and ligated into the Ncol and BamHI sites of pETl 1 a to produce 
plasmid pETSaholA. 

15 

Example 20 - Purification of aureus 5 

The pETSaholA plasmid of Example 19 was transformed into E. coli 
NovaBlue (recAl \ac[F'proA'B^ lac'^ZAM15::TnlO(Tc^)) (Novagen). A single 

20 colony was used to innoculate 12L of LB media supplemented with 200 |ig/ml 
ampicilHn. Gells (12L) were grown at 'hTQ to OD6oo=0.5 at which point the 
temperature was lowered to 1 S^'C and 0.5 mM EPTG was added. After 1 6 hr of 
induction, cells were collected by centrifugation, resuspended in 50 mM Tris-HGl (pH 
7.5), 10% sucrose, IM NaCl, 30 mM spermidine, 5 mM DTT, and 2 mM EDTA. 

25 Cells were lysed by two passages through a French press (1 5,000 psi), followed by 
centrifugation at 13,000 rpm for 30 min at 4''C. Ammonium sulfate (0.3 g/ml) was 
added to the clarified lysate. The resulting pellet was resuspended in 250 ml of buffer 
A, The dialyzed protein was applied to a 100 ml FFQ Sepharose column equilibrated 
in buffer A and eluted with a 1000 ml linear gradient of 0 - 500 mM NaCl in buffer A; 

30 80 fractions were collected. Peak fractions (40-49) were combined (65 mg) and 
dialyzed against buffer A. The dialyzed protein was applied to a 8 ml MonoQ 
Sepharose column equilibrated in buffer A and eluted with a 80 ml linear gradient of 0 
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- 500 mM NaCl in buffer A; 80 fractions were collected. Peak fractions of the 6 
preparation were stored frozen at -SO^^C. 

Example 21 - Consitution of a Processive 5. aureus DNA Polymerase III Enzyme 
5 from Three Components 

The Pole (alpha-large) requires the p clamp for processivity, which in 
turn requires the clamp loader (t56') for assembly onto DNA. The S, aureus clamp 
loader, x55' complex, was assembled by mixing the three proteins as follows: 400 |ig 

10 oft and 80 |ig each of 6 and 8* were mixed in buffer A containing no NaCl and 
preincubated at 15**C for 10 min. The mixture was injected onto a 1 ml MonoQ 
column equilibrated in buffer A, and then eluted with a 30 ml linear gradient of 0-500 
mM NaCl in buffer A; 60 fractions were collected. Fractions were analyzed in a 10% 
SDS-polyacrylamide gel stained with Coomassie Blue. Peak fractions (40-50) were 

15 combined and concentrated using a Centricon 30 concentrator. 

The ability of the three components to work together to form the 
processive Pol HI was tested by determining whether t55' and p clamp could confer 
the ability of PolC to completely extend a single primer full circle around a large 7.2 
kb circular M13mpl8 ssDNA genome. Replication reaction contained 70 ng (25 

20 finol) on singly primed Ml 3mpl 8 ssDNA, 20 ng S, aureus P, 50 ng S. aureus PolC. 
either 30 ng or 90 ng of S. aureus t65' (when indicated), and 0.82 ]xg ofS, pyogenes 
SSB in 24 lal of 20 mM Tris-HCI (pH 7.5), 4% glycerol, 0.1 mM EDTA, 5 mM DTT, 
2 mM ATP, 8 mM MgCh, 40 \xg/ml BSA, and 60 mM each of dGTP and dCTP. 
Reactions were pre-incubated for 2 min at 37°C to assemble protein complexes on the 

25 primer terminus. DNA synthesis was initiated upon addition of 1.5 |al dATP and ^^P- 
TTP (specific activity 2,000-4,000 cpm/pmol) and synthesis was allowed to proceed 
for 1 min before being quenched with an equal volume (25 |il) of a solution of 1% 
SDS and 40 mM EDTA. One-half of the quenched reaction was analyzed for total 
DNA synthesis using DE8 1 paper as described, and the other half was analyzed by 

30 agarose gel phoresis. An autoradiogram of the agarose gel analysis of the repUcation 
products is depicted in Figure 13, which shows that the presence of PolC and p, but 
absence of t65' (lane 1) gives no frill length circular duplex (RFII). However, in the 
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presence of t85* (lanes 2 and 3), full length circular duplex DNA (RFII) is produced, 
as expected for the action of a processive Pol III holozyme. 

Example 22 - General Induction/Purification Conditions for 5. pyogenes 

5 

The purification protocols for S. pyogenes proteins were performed 
using following standardized conditions. Cells were grown from a single colony, 
freshly transformed overnight. Cells were grown in 200 ng/ml Ampicillin to 
00600=0.3-0.4, at which point cultures were chilled prior to addition of IPTG (to a 

10 final concentration of 0.5 mM) and were allowed to incubate for 16 hrs at 1 5°C. 

Following this, all procedures were performed at 4'*C- Cell paste (1-2 g/liter of 
culture) was resuspended (10 ml/g cell paste) in 50 mM Tris-HCI (pH 7.5)/10% 
Sucrose/1 M NaCl/5 mM DTT/ 30 mM Spermidine/1 X Heat lysis buffer (50 mM 
Tris-HCI (pH 7.5), 1% Sucrose, 100 mM NaCl, 2 mM EDTA). Cells were lysed by 

15 two passages through the French Press (1 5,000 psi) followed by centrifiigation at 

14,000 rpm at 4 °C. Ammonium sulfate, when added to the cleared lysate, was added 
gradually. Precipitate was allowed to settle on ice for a minimum of 30 min prior to 
collection by centrifiigation. Protein pellets were resuspended in buffer A (50 mM 
Tris-HCI pH 7.5, 1 mM EDTA, 5 mM DTT, 10% glycerol) and dialyzed for over 3 

20 hours in the same buffer. Column design is based on the manufacturer's suggested 
capacities: Fast Flow Q (FFQ) and MonoQ are 20 mg protein /ml resin, Heparin- 
Affigel agarose is 1.2 mg protein/ml resin. Elution was performed using 10 column 
volume (c.v.) gradients, and the entire gradient elution profile was collected in 80 
fractions. Unless mentioned otherwise all columns were equilibrated and eluted with 

25 buffer A. 

Example 23 - Identification of a 5. pyogenes holA gene Encoding a Functional 
Delta Subunit and Purification of the Delta Subunit 

30 Alignment oiE. coli delta subunit with 1 0 other putative holA products 

from unfinished genome databases of Gram negative bacteria indicates a region of 
conserved amino acid sequence. Amino acids Q140 to L230 ofE, coli delta were 
used to search the B, subtilis genome database for a Gram positive delta homolog. 
This search revealed yqeN, a potential reading frame of unknown function, as the 
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highest scoring sequence. Although the score was low, it was treated as a candidate 
for Gram positive delta. The alignment with E. coli delta is shown in Figure 12A. A 
Streptococcus pyogenes genome database was searched v^i^yqeN, Two contigs 
which represent N- (contig 206) and C- (contig 264) termini of 5. pyogenes delta 
subunit were identified. The alignment of the putative S. pyogenes holA with B. 
subtilis yqeN \s shov/n in Figure 12B. The following primers were used to obtain 
PCR products for delta subunit: 

holA Upstream (SEQ. ID No. 69) 

ggagcagatt gcttttgata catatgattg gcctattc 33 
holA Downstream (SEQ. ID No. 70) 

ttgtctccgc atcaaactgg gatccaagag catcatacgc gtatgg 45 

These primers were used to amplify the hoIA gene from pyogenes genomic DNA. 
The PCR product was digested with Ndel and BamHI, purified and ligated into the 
pETl la vector to produce pETl la.S.p. holA. 

The pETI la.S.p. hoi A plasmid was transformed into the 
BL21(DE3)RecA- strain oiE, coli, A single colony from an overnight transformation 
was used to innoculate 12L LB broth supplemented with 200 jig/ml Ampicillin. Cells 
were grovm at 37'C to OD600=0.5, at which point the temperature was lowered to 
1 5"C and 0.5 mM IPTG was added. Induction proceeded for 16 hrs. In the morning, 
cells were collected by centrifiigation and resuspended in 50 mM Tris-HCl (pH 7.5)/ 
10% Sucrose /IX Heat Lysis Buffer/I M NaCl/30 mM Spermidine/5 mM DTT. Cells 
were lysed by two passages through the French press (15,000 psi), followed by 
centrifiigation at 13,000 rpm for 30 min. The supernatant was decanted and 
ammonium sulfate was added to a final concentration of 0.226 g/ml. The resulting 
pellet was collected by centrifiigation and resuspended in 20 ml of buffer A. The 
resuspended pellet was dialyzed against buffer A containing no salt. The dialyzed 
protein (500 mg) was loaded onto a FFQ- Sepharose (35 ml) column and eluted with a 
linear gradient from 0 - 500 mM NaCl ( 10 c.v.). The peak fractions (21-45) were 
combined and dialyzed against buffer A (0 NaCI) for 3 hrs, then diluted to a 
conductivity of 50 mM NaCl and loaded (160 mg) onto a 120 ml Heparin- Affigel 
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10 



column. Protein was eluted with a linear gradient of 0-500 mM NaCl (10 c.v.). The 
fractions containing the least contaminants (39-51) were precipitated with ammonium 
-sulfate (0.226 g), collected by centrifugation, resuspended 5 ml of buffer A, and 
dialyzed in buffer A containing 200 mM NaCl, The delta subunit was stored at - 
80°C. The final delta subunit preparation is shown in the lane marked 6 of the 
Coomassie Blue stained SDS-polyacryl amide gel of Figure 14. Yield = 65 mg. 



Example 24 - Identification of 5. pyogenes holB Encoding Delta Prime and 
Purification of the Delta Prime Subunit 



A search of the S, pyogenes genome database with the predicted B, 
subtilis delta prime amino acid sequence revealed a DNA sequence in contig #209 
(previously known as contig #210) that predicted a high scoring match for a gene 
encoding a delta prime protein. The following primers were used to obtain PGR 
15 products for holB\ 

holB Upstream (SEQ. ID. No. 71) 

gcctaggata agggagggta catatggatt tagcgc 36 

20 holB Downstream (SEP. ID. No. 72) 

cgggcaagtc ttttgacaag cttcggatcc ccataacgaa ttcc 44 

The PGR product obtained from these primers was digested with Ndel and BamHI, 
purified and ligated into the pETl la vector to produce pETl la.S.p. holB. 

25 The pETl la.S.p.holB plasmid was transformed into the 

BL21(DE3)RecA- strain of £. colL A single colony from an overnight transformation 
was used to innoculate 12L LB broth supplemented with 200 ng/ml Ampicillin. Gells 
were grown at 37**C to O.D. 600=0.4, at which point the temperature was lowered to 
1 S^'G and 0.5 mM ffTG was added. Induction proceeded for 16 hrs. In the morning, 

30 cells were collected by centrifugation and resuspended in 100 ml 50 mM Tris-HGl 
(pH 7.5)/ 10% Sucrose /IX Heat Lysis Buffer. Lysis was initiated upon addition of 
0.4 mg/ml lysozyme followed by a 1 hr incubation on ice. Lysate was clarified by 
centrifugation at 13,000 rpm for 30 min. Anmionixim sulfate was added to the 
supernatant to a final concentration of 0.3 g/ml. The protein pellet was resuspended in 
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buffer A(0.1 M NaCl) + 0.24 g/ml ammonium sulfate and clarified by centrifiigation. 
The resulting protein pellet was resuspended in 20 ml of buffer A and dialyzed against 
buffer A. The dialyzed protein (450 mg) was loaded onto a 30 ml FFQ- Sepharose 
column and eluted with a linear gradient from 0 - 500 mM NaCl. The peak fractions 
5 were combined (fi* 20-30 containing 130 mg) and dialyzed against buffer A and 

loaded (70 mg) onto a 50 ml Heparin- Affigel column. Protein was eluted with a 
linear gradient of 0-500 mM NaCl. Delta prime binds weakly to both resins and elutes 
in the beginning of the gradient. This delta prime subunit was stored frozen at - SO'^C. 
The final delta prime subunit preparation is shown in lane marked 6' of the Coomassie 
10 Blue stained SDS-polyacrylamide gel of Figure 14. Yield = 40 mg. 

Example 25 - Identification of the 5. pyogenes dnaX Gene and Purification of the 
Tau Subunit 

15 A search of the S. pyogenes genome database with the putative B, 

subtilis tau amino acid sequence revealed a DNA sequence in contig #284 (previously 
known as contig # 289) with a high scoring match which predicted a gene encoding 
for a tau subunit protein. A set of PCR primers to 5'- and 3'- termini of the putative 
gene sequence were designed to include restriction enzyme recognition sequences for 

20 Ndel and BamHI sites, respectively. These primers are: 

c/^zoA^Upstream (SEQ. ID. No. 73) 

ggagttaaaa acatatgtat caagctcttt ate 33 

25 t//;aX Downstream (SEQ. ID. No. 74) 

cgtgggtaag ggcaaaacgg atcccttatg tatttcag 38 

A PCR product obtained with the above primers was digested with Ndel and BamHI, 
purified and ligated into pETl la vector to produce pETl la.S.p.dnaX. 
30 The pETl la.S.p.dnaX plasmid was transformed into the 

BL21(DE3)RecA- strain ofE, colL A single colony from an overnight transformation 
was used to innoculate 24L LB broth supplemented with 200 ng/ml Ampicillin. Cells 
were grown at 37°C to O.D.600=0.5, at which point the temperature was lowered to 
15°C and 0.5 mM IPTG was added. Induction proceeded for 16 hrs. In the morning. 
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cells were collected by centrifiigation and resuspended in 200 mis of 50 mM Tris-HCl 
(pH 7.5)/ 10% Sucrose /IX Heat Lysis Buffer/IM NaCl/30 mM Spermidine/5 mM 
DTT/5 mM EDTA. Cells were lysed by two passages through the French press 
(15,000 psi), followed by centrifiigation at 13,000 rpm for 30 min. The supernatant 
(2.4 gm) was dialyzed against buffer A containing 50 mM NaCl, loaded onto a 1 20 ml 
FFQ column (without ammonium sulfate precipitation) and eluted with a linear 
gradient of 100-700 mM NaCL The peak fractions (fi^ 41-55) were combined, 
diluted with buffer A containing no salt (a dilution of 1/5) to a conductivity of 100 
mM NaCl, loaded (310 mg) onto a 300 ml Heparin- Affi gel column, and eluted with a 
linear gradient of 100-500 mM NaCl. The peak fi-actions (&# 21-36) were combined, 
dialyzed against buffer A, loaded (87 mg) onto 1 0 ml FFQ column, and eluted as 
described for the first FFQ column. The peak fractions (fT# 27-41) were concentrated 
by centrifiigation in Centriprep 30 filtration unit and frozen at -SO^'C. The fmal tau 
subunit preparation is shown in the lane marked t of the Coomassie Blue stained SDS- 
polyacrylamide gel of Figure 14. Yield - 103 mg. 

Example 26 - IdeDtification of the S, pyogenes dnaN Gene and Purification of the 
Beta Subunit 

A search of the S. pyogenes genome database with the putative 5. 
subiilis beta subunit amino acid sequence revealed a DNA sequence (contig # 266) 
with a high scoring match which predicted a gene encoding for a beta subunit protein. 
A set of PCR primers to 5'- and 3'- termini of the putative gene sequence were 
designed to include restriction enzyme recognition sequences for Ndel and BamHI, 
respectively. The primers were: 

Upstream (SEQ. E). No. 75) 
ggagttcata tgattcaatt ttcaaattaa tcgc 34 

cfag// Downstream (SEQ. ID. No. 76) 

tatcagctcc tggatccagt accttccatt gattagcc 39 

A PCR product obtained with these primers was digested with Ndel and BamHI, 
purified and ligated into pET16b vector to produce pET16b.S.p.dnaN. 
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The pET16b.S.p,dnaN plasmid was transfonned into the 
BL21(DE3)RecA- strain of E. coli, A single colony from an overnight transformation 
was used to innoculate 15L LB broth supplemented with 200 ^g/ml Ampicillin. Cells 
were grown at 3T'C to O.D.600=0.4, at which the point temperature was lowered to 
5 IS^'C and 0.5 mM IPTG was added. Induction proceeded for 16 hrs. In the morning, 

cells were collected by centrifugation and resuspended in 100 ml 50 mM Tris-HCl 
(pH 7.5)/ 10% Sucrose /IX Heat Lysis BufTer/1 M NaCy5 mM DTT/ 30 mM 
Spermidine/5 mM EDTA. Cells were lysed by two passages through the French press 
(15,000 psi), followed by centrifugation at 13,000 rpm for 30 min. Ammonium 

10 sulfate was added to the supernatant to a final concentration of 0.3 g/ml. The resulting 
protein pellet was resuspended and dialyzed against buffer A containing 50 mM NaCl. 
The dialyzed protein (300 mg) was loaded onto a 45 ml FFQ- Sepharose column and 
eluted with a linear gradient from 50 - 500 mM NaCl. The peak fractions (16-30) were 
combined, dialyzed against buffer A containing 50 mM NaCl, loaded onto a 25 ml 

15 EAH-Sepharose column, and eluted with a hnear gradient of 50-500 mM NaCl. The 

fractions containing the least contaminants were combined into two pools (pool 1 10- 
17, pool n 19-27). Each pool was fiirther purified on a 8 ml MonoQ column 
(perfomied under conditions described for the FFQ column above). The final beta 
subunit preparation is shown in the lane marked p of the Coomassie Blue stained 

20 SDS-polyacrylamide gel of Figure 14. Yield = 48 mg. 

Example 27 - Identification of the S. pyogenes polC Gene and Purification of the 
Alpha-Large Polymerase Subunit 

25 A search of the B, subtilis genome database with the E. coli alpha 

subunit amino acid sequence revealed two DNA sequences with a high scoring match 
which predicted two genes encoding alpha-like polymerase subunits. The DNA 
sequence with the second highest scoring match which encoded the largest of the two 
polymerase subunits also appeared to encode for the epsilon exonuclease domain at 

30 the N- terminus of the putative alpha subunit. A search of the B. subtilis genome 
database with S, pyogenes DNA sequence confirmed this nucleotide sequence to 
encode the Gram positive homolog of the E. coli replicative polymerase subunit 
(alpha). This Gram negative alpha-like subunit lacked homology to epsilon. The 
gene encoding the large alpha polypeptide sequence (alpha-large) will be referred to as 
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the product of the polC gene and the gene encoding the smaller Gram-negative alpha- 
like polymerase (alpha-small) will be referred to as the product of the polE or dnaE 
gene (see Example 28). 

The alpha-large polymerase polypeptide is a product of two 
5 overlapping contigs; contig #197 (renamed #193) encodes the N-terminal 630 amino 
acids, and contig #278 (renamed #273) encodes the C-terminal 1392 amino acids. The 
putative Open Reading Frame generates a 1464 amino acid polypeptide (SEQ. ID. 
No. 18). Since the polC nucleotide sequence contained several Ndel sites, a primer 
was designed to mutate two restriction endonuclease sites in the pETl la nucleotide 

10 sequence upstream of the N-terminus of the gene; an Xbal restriction site was mutated 

to an Nhel restriction site and an Ndel restriction site at the starting ATG was 
removed. A 74mer primer which spans from mutated Xbal site upstream of T7 
promoter includes Nhel site, rbs site (ribosome binding site), mutated Ndel site and 
first 10 amino acid codons of polC gene sequence. The following primers were used 

15 in a PGR reaction to amplify polC gene from 5. pyogenes genomic DNA: 

po/CUpstream (SEQ. ID. No. 77) 

ggataacaat tccccgctag caataatttt gtttaacttt aagaaggaga tatacccatg 60 
tcagatttat tcgc 74 

20 

polC Downstream (SEQ. ID. No. 78) 

cggtgtctct atctaaatga ctcatttggg atcctcgctt tatacggtat gtcacag 57 

Elongase (BRL) produced the best amplification results. PGR reaction conditions 
25 were: 5 |ig genomic DNA, 20 ng of each primer, 1 ml Elongase, 60 fiM each dNTP, 
in 100 ml Elongase reaction buffer for 1 min at 94T, 1 min at 55**C, and 6 min at 
eO'^C repeated for 40 cycles. The resulting 4000 bp PGR fragment was digested with 
Nhel and BamHI, purified and ligated into the pETl la vector (digested with Xbal and 
BamHI) to produce pETl la.S.p.polC. 
30 The pETl 1 a.S.p.polC plasmid was transformed into the 

BL21(DE3)RecA- strain of £ colt, A single colony from an overnight transformation 
was used to innoculate 24L LB broth supplemented with 200 ng/ml Ampicillin. Cells 
were grown at 37**C to OD600=0.4 at which point temperature was lowered to IS^'C 
and 0.5 mM IPTG was added. Induction proceeded for 16 hrs. In the morning, cells 
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(12g) were collected by centrifiigation and resuspended in 100 ml 50 mM Tris-HCl 
(pH 7.5)/ 10% Sucrose /IX Heat Lysis Buflfer/l M NaCl/5mM DTT/30 mM 
Spermidine/5 mM EDTA. Cells were lysed by two passages through the French press 
(1 5,000 psi), followed by centrifiigation at 13,000 rpm for 30 min. Ammonium 
sulfate was added to the supernatant to a final concentration of 0.226 g/ml. The 
precipitate was collected by centrifiigation. The protein pellet (220 mg resuspended in 
buffer A) was dialyzed against buffer A containing 150 mM NaCl, loaded onto an 8 
ml FFQ colunin equilibrated with buffer A containing 150 mM NaCI, and eluted with 
a linear gradient of buffer A containing 150-600mM NaCl. The fi-actions containing 
the least contaminants (M 42-64) were combined and precipitated with ammonium 
sulfate (0.226 g/ml). The precipitate was collected by centrifiigation and resuspended 
in buffer A (10 mg/ml in 5 ml). A fraction (1 ml=10mgs) of the concentrated protein 
was dialyzed, loaded onto 10 ml ssDNA-agarose column, and eluted with a linear 
gradient of 50-500 mM NaCl. The peak fractions {M 30-50) were combined and 
concentrated with ammonium sulfate (as above). The final alpha-large subunit 
preparation is shown in lane marked ol of the Coomassie Blue stained SDS- 
polyacrylamide gel of Figure 14. Yield= 4 mgs. 

Example 28 - IdentificatioD of the S. pyogenes dnaE Gene and Purification of the 
Alpha-Small Polymerase 

A search of the 5. subtilis genome database using the E. coli alpha 
subunit amino acid sequence revealed two DNA sequences with a high scoring match 
which predicted two genes encoding for alpha-like polymerase subunits. The DNA 
sequence with the highest scoring match encodes a smaller alpha polymerase which 
does not contain an exonuclease domain. The putative short alpha DNA sequence is 
a product of the open reading frame in contig #253 of the S, pyogenes genome 
database. A set of PCR primers to 5'- and 3 '-termini of the putative gene sequence 
were designed to include restriction enzyme recognition sequences for Ndel and 
BamHI, respectively. The primers were: 



g -short Upstream (SEQ. ID. No. 79) 

gggaacaaga taaccaagga ggaacccatg gttgctcaac ttg 



43 
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g -short Downstream (SEQ. ID. No. 80) 

cgaatagcag cgttcatacc aggatcctcg ccgccactgg 40 

A PCR product obtained with these primers was digested with Ndel and BamHI, 
5 purified and hgated into pETl 1 a vector to produce pETl 1 a.S.p.dnaE. 

The pETna.S.p.dnaE plasmid was transformed into the 
BL21(DE3)RecA- strain of E. coli. A single colony from an overnight transfomation 
was used to innoculate 12L LB broth supplimented with 200 \ig/m\ Ampiciliin. Cells 
were grown at 3TC to OD600^0.4, at which point temperature was lowered to 1 5**C 

10 and 0.5 mM IPTG was added. Induction proceeded for 16 hrs. In the morning, cells 

were collected by centrifiigation and resuspended in 1 00 mis 50 mM Tris-HCl (pH 
7.5)/ 10% Sucrose /IX Heat Lysis Buffer/5 mM DTT/30 mM Spermidine/IM NaCl/5 
mM EDTA. Cells were lysed by two passages through the French press (1 5,000 psi), 
followed by centrifiigation at 13,000 rpm for 30 min. Ammonium sulfate was added 

15 to the supernatant to a final concentration of 0.226 g/ml. The precipitate was 

collected by centrifugalion. The protein pellet (resuspended in buffer A) was then 
dialyzed against buffer A. The dialyzed protein (600 mg) was loaded onto a 30 ml 
FFQ and eluted with a linear gradient of buffer A containing 50-500 mM NaCl. The 
peak fractions (200 mg in fr # 70-79) were dialyzed and loaded onto a 100 ml 

20 Heparin- Affi gel column. The fractions containing the least contaminants (100 mg 

from fr # 18-30) were pooled and dialyzed against buffer A containing 300 mM NaCl. 
The dialysate (50 mg) was loaded onto a 50 ml ssDNA-agarose colunm and eluted 
with a linear gradient of 300mM - IM NaCl The final alpha-small subunit 
preparation is shown in lane marked of the Coomassie Blue stained SDS- 

25 polyacrylamide gel of Figure 14. Yield = 25 mg. 

Example 29 - Identification of the 5. pyogenes ssb Gene and Purification of the 
Single Strand DNA-Binding Protein 

30 Search of the S. pyogenes genome using the B. subtilis SSB amino acid 

sequence identified a polypeptide in contig #230(212) as having highest homology to 
single strand binding protein of several Gram negative bacteria. This contig lacked the 
first 26 amino acids at the N-terminus. Circular PCR was employed to identify the 
DNA encoding the N-terminus of the putative SSB protein. S, pyogenes genomic 
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DNA was digested overnight with Apol (5 \ig chromosomal DNA in a 50 nl reaction). 
The DNA was extracted with phenol and precipitated with ethanol. The Apol 
digested chromosomal DNA was self-ligated to generate circular template for future 
use in the circular PGR. A circular PGR was performed with primers designed to 
5 anneal back-to-back to amplify circularized Apol reaction fragments. The primers 
were; 

ssb.circ Upstream (SEQ. ID. No. 81) 

accattttgg cttttaaagg tacggttaac agcaagtgtg aaggtagcc 4 9 

10 

ssb.circ Downstream (SEQ. ID. No. 82) 

gaacgcgagg cagatttcat taactgtgtg atctggcg 3 8 

The PGR reaction conditions were as follows: 100 ng circularized 5'. pyogenes 
15 genomic DNA, 20 ng each primer, 1 ml Elongase, 60 ^iM each dNTP, 1 00 1 

Elongase reaction buffer. Amplification was performed for 40 cycles as follows: 
denature, 1 min at 94**C; anneal, 1 min at SS'^G; and extend, 5 min at 68^G. PGR 
products were cloned into the Topo TA vector following instructions of the 
manufacturer (Promega). Several positive clones were sequenced to obtain N- 
20 terminal nucleotide sequence. This information lead to design of the following 
primers with which the use of a standard PGR reaction generated whole ssb gene 
products. The primers were: 

ssb Upstream (SEQ. ID. No. 83) 
25 tttaaaagag ggtagcatat gattaataat gtagtactag ttggtcgc 48 

ssb Dovynstrcam (SEQ. ID. No. 84) 

tttaaattta aacctaggtt caatccattc tgactagaat ggaagatcgt c 51 

30 The resulting PGR product was digested with Ndel and BamHI, purified and ligated 
into pETl la vector to produce pETl 1 a.S.p. ssb. 

The pETl la.S.p.ssb plasmid was transformed into the 
BL21 (DE3)RecA- strain of £. colL A single colony from an overnight transformation 
was used to innoculate 12L LB broth supplemented with 200 ]ig/m\ Ampicillin. Gells 
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were grown at 37"C to OD600=0.5, at which point 0.5 mM IPTG was added. At the 
end of the 3 hr induction, cells were collected by centrifugation and resuspended in 
100 ml of 50 mM Tris-HCl (pH 7.5)/ 10% Sucrose /IX Heat Lysis Buffer/5 mM 
DTT/5 mM EDTA. The cell lysis was initiated upon addition of 0.4 mg/ml lysozyme 
followed by a 1 hr incubation on ice. The lysate was clarified by centrifugation at 
1 3,000 rpm for 30 min. The SSB protein was significantly purified by sequential 
fractionation with ammonium sulfate in the following manner. Solid ammonium 
sulfate was added to the clarified lysate to a final concentration of 0.24 g/ml and the 
precipitated protein was collected by centrifugation at 13,000 rpm for 30 min. The 
resulting pellet was homogenized in buffer A(0.1 M NaCl) + 0.24 g/ml ammonium 
sulfate and the precipitate was collected by centrifugation. This procedure was 
repeated with buffer A(0.1 M NaCl) + 0.2 g/ml ammonium sulfate, buffer A(0.1 M 
NaCl + 0. 1 5 g/ml ammonium sulfate, and buffer A(0. 1 M NaCl) + 0. 1 3 g/ml 
ammonium sulfate. The final pellet was resuspended in buffer A + 0.15 M NaCl and 
dialyzed against the same buffer. The resulting pellet was resuspended in buffer A 
and dialyzed against buffer A containing 500 mM NaCl. The dialysate (300 mg) was 
diluted to 0.15 M NaCl before it was loaded onto a 20 ml MonoQ column and eluted 
with a Hnear gradient of 0. 15 M - 0.5 M NaCl in buffer A. The SSB protein elutes in 
the very beginning of the gradient. The peak fractions were combined (1 50 mg in 
fractions 16-30), diluted to 0.05 M NaCl, loaded onto a 10 ml ssDNA-agarose 
column, and eluted with 0.5 M NaCl. The peak fractions (32-62) were combined and 
frozen. The SSB was further purified over a MonoQ column to remove contaminating 
polymerase activity. The final single strand DNA binding protein preparation is shown 
in lane marked ssb of the Coomassie Blue stained SDS-polyacrylamide gel of Figure 
14. Yield =] 20 mg. 

Example 30 - First Demonstration that S. pyogene hoU Encodes a Delta Subunit 
Involved In Replication: Assembly of t56' Complex 

Gel filtration is a standard analytical technique to demonsfrate direct 
protein-protein interaction. Purified x, 6. 6' proteins were used to examine whether 
they form a protein complex assembly. Gel filtration oft mixed with either 5. 5',or 
both 5 and 6' was performed using an HR 10/30 Superose 6 column equilibrated with 
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buffer A containing 100 mM NaCl. Either 6 (200 ng), 6' (200 ^g), or a mixture 
of 5 and 6' (200 ^g each) was incubated for 30 min at 15°C in 100 ^1 of buffer A 
containing 100 niM NaCl, and the entire mixture was injected onto the column. The 
mixture was resolved on the colunm by collection of 170 ^1 fractions after the initial 
5 void (6.6 nl) volume was collected. Fractions were analyzed by 10% SDS- 
polyacrylamide gels (30 ^il/lane) stained with Coomassie Blue. 

The results, in Figure 15, demonstrate that under these conditions 
the T protein exhibits no (weak) interaction with the delta (Figure 1 5B) and the delta 
prime subunits (Figure 1 5C) individually, and yet assembles readily into a complex 

10 when all the subunits are mixed in the reaction (Figure 1 5 A). The t protein was 

mixed wnth a 2-fold molar excess of each 5 and i\ then gel filtered. A complex 
of t55* was formed as demonstrated by coellution of 6 and 5* with t (fr# 22-30) whereas 
excess 56' complex elutes in later fractions (fr#38-46). To determine whether 
individual 6 or 5' subunits interact with x, the x subunit was mixed with either 5 or 6* 

15 and then gel filtered. The results demonstrate that a gel filterable complex does not 

form when t is mixed with 5 (Figure 15B) or 5' (Figure 1 5C) subunits individually, as 
indicated by the absence of these subunits in the x containing fractions (fr#20-26). 
Therefore, it appears that the presence of both 6 and 5' subunits is essential for the 
formation of the x65' complex. 

20 

Example 31 - Second Demonstration ih^t S. pyogenes holA Encodes Delta: 
Functional Assembly of P on DNA 

Gel filtration was used to demonstrate that the x, 5, 6* proteins form a 
25 functional clamp loading complex which is able to load the p clamp onto a circular 
DNA molecule. The reaction contained 0.5 pmol of gp2 nicked pBluescript plasmid 
(a circular double strand plasmid with a single nick produced by Ml 3 gp2 protein), 1 
pmol [^2p]p^ 0.5 pmol x55' complex, 0.25 pmol of either 6, 5*, x were used in 
individual experiments when a subassembly of the complex was tested (x5, x6*, hS) in 
30 75 nl buffer B (20 mM Tris-HCl (pH 7.5), 20 % glycerol, 0.1 mM EDTA, 5 mM 

DTT, 2 mM ATP, 8 mM MgCl2). P was incubated with nicked DNA for 10 min at 
37''C either alone, or in combination with various assemblies of the x complex. All gel 
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filtration experiments were performed at 4°C. The reaction mixtures were applied to a 
5 ml column of Bio-Gel 15M (Bio-Rad) equilibrated in buffer B containing 100 mM 
NaCl. Fractions of 170 |il were collected and quantitated in the Scintillation counter. 

The results, in Figure 16, demonstrate thai the assembly of the ring 
5 onto a circular DNA molecule requires the presence of t. 5, and 6* proteins 

(Figure 16A). In absence of any one of the subunits, loading onto DNA does not 
occur (Figure 16B-E). The clamp loader complex (t56') can be supplied as a mixture 
oft, 6, 5* subunits or as an assembled complex (purified fi-om unassembled subunits by 
gel filtration, or by ion exchange chromatography on MonoQ). Proteins bound to the 
10 large DNA molecule elute in the early fi-actions (void fir# 10-17) and resolve firom ft-ee 

proteins that elute in later firactions (fi^ 18-35). 

Example 32 - The t Subunit Product of the dnaX Gene Binds a -large 

15 The interaction of 5. pyogenes a and t proteins was examined by 

analyzing a mixture of the proteins by gel filtration. Gel filtration oft, a -large or a 
mixture of a-large and x was perfonned using an HR 10/30 Superose 6 colunm 
equilibrated with buffer A containing 100 mM NaCl. Either a-large (400 ^g) (200 
^M) or a mixture of a-large and t was incubated for 30 min at 15**C in 100 \i\ of 

20 buffer A containing 100 mM NaCl, and the entire mixture was injected onto the 

column. The mixture was resolved on the column by collection of 170 \i\ fi*actions 
after the initial void (6.6 ml) volume was collected. Fractions were analyzed by 10% 
SDS-polyacrylamide gels (30 jil/lane) stained with Coomassie Blue. 

The results show a complex of at'^ was formed as demonstrated by 

25 coellution of a-large and t (fi^ 30-38) proteins (Figure 17A) compared to the elution 
profile of individual proteins (Figure 17B-C). Also, the migration of the t in the a\x 
complex changes significantly to a larger complex (4 firactions, fi-om fi^ 37 tafi:# 33). 

Example 33 - Formation of ocyj^bb' Complex 

30 

To determine whether a aLT65' complex could form, the following 
components were mixed: a -large (400 \ig, 2.5 nmol), t (200 [ig, 1.3 nmol), 5 (200 jig. 
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4.8 nmol), 5' (200 ^g, 5.75 pmol) in a final volume of 150 pi. The mixture was diluted 
to 300 ml with buffer A to lower conductivity of the sample to that equivalent of 100 
mM NaCl and incubated for 30 min at 15"C. The mixture was injected onto a 
Superose 6 column (equilibrated with buffer A containing 100 mM NaCl) and 
fractions (170 were collected after an initial 6.6 ml of void volume was collected. 
Fractions were analyzed by 10% SDS-polyacrylamide gels (30 ^1/lane) stained with 
Coomassie Blue. 

A gel filterable complex (Figure 18A) of ultSS' was formed as 
demonstrated by coellution of t, 6 and 5' with a -large (fr# 14-26), whereas 
excess 66' complex elutes in later fractions (fi* 30-38). The migration of 
the t66* protein complex in the qltSS* complex does not change significantly. The 
complex might dissociate under the nonequilibrium conditions of gel filtration due to 
low concentration of proteins, salt concentration and speed of resolution. 

Next, ion exchange chromatography was used to analyze the protein 
mixture to prepare the reconstituted aLT65* complex of 5. pyogenes. The ajj&6' 
complex was reconstituted upon mixing a -large (10 mg, 62 nmol), t (6 mg, 72 nmol), 
6 (3.3 mg, 80 nmol), 6' (1.6 mg, 90 nmol). The a, t, 5, 5* protein mixture was dialyzed 
for 2 hrs against buffer A containing 50 mM NaCI. The entire mixture was loaded 
onto a 1 ml MonoQ column equilibrated in buffer A containing 50 mM NaCl. Proteins 
were eluted with a 20 column volume linear gradient of 50-500 mM NaCl in buffer A 
and 0. 25 ml fractions were collected. Fractions were analyzed by 10% SDS- 
polyacrylamide gels (20 nl/lane) stained with Coomassie Blue. 

Generally, the reconstitution of the aLT55' complex on a MonoQ 
column results in a tight salt resistant complex (Figure 18B, fr# 23-35) which elutes at 
500 mM NaCL The high concentration of the proteins in the eluted fractions 
contributes to stability of the complex. 

Example 34 - The S. pyogenes Three Component Pol III-L Polymerase Is Rapid 
and Processive In DNA Synthesis 

It was previously demonstrated (i.e., in Examples 29 and 30) that the 
putative delta subunit plays an integral part in the assembly of the t55' complex 
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(Figure 15) and that this complex is sufficient to assemble p clamps onto circular 
primed DNA (Figure 16). It was also shown that the strong interaction between the a - 
large and x subunits (Figure 17) results in an isolatable aixbb' complex (Figure 18), 
similar to that of the E. coli DNA polymerase HI*. 
5 The MonoQ fractions containing a\xhh' complex were then used to 

assemble p onto primed DNA and determine whether this now resulted in rapid and 
processive DNA synthesis. Replication reactions contained 70 ng of singly primed 
M13mpl8 ssDNA and 0.82 ^g of 5. pyogenes SSB in 25 nl buffer C (20 mM Tris- 
HCl (pH 7.5), 4 % glycerol, 0.1 mM EDTA, 5 mM DTT, 2 mM ATP, 8 mM MgCl2) 
10 with 60 jiM each of dGTP, dCTP, and dATP, 30 ^M cold TTP and 20 ]iM [a-32P] 
TTP (specific activity of 2,000-4,000 cpm/pmol). The complex is assembled onto 
DNA in the following manner: 40 ng (3:1) or 140 ng (10:1) of the au^bb' complex and 

60 ng of p protein were preincubated for 2 min at 30°C in presence of SSB coated 
primed Ml 3 DNA and two nucleotides (dCTP and dGTP). Reactions were initiated by 

15 addition of the two remaining nucleotides dATP and TTP and quenched with an equal 
volume of 1% SDS/40 mM EDTA. Each time point is a separate reaction. 

A time course of replication on singly primed circular Ml 3mp 18 
ssDNA is shown in Figure 19. The agarose gel analysis shows conversion of the 
oligonucleotide primed single stranded DNA to the slower migrating replicative form 

20 n. The fact that the speed of synthesis is independent of the concentration of 

polymerase in the reaction indicates that the a\x^h' complex synthesizes DNA in a 
rapid and a highly processive manner. The S, pyogenes aixbS complex in presence of 

the p clamp, completely rephcates (is able to complete replication of) 7250 nt of 
M13mpl8 ssDNA in 8-9 sec. 

25 

Example 35 - The S. pyogenes DnaE (a-small) Forms a Three-Component 
Polymerase with t55' and p 

The S. pyogenes DnaE (a-small) polymerase is more homologous to E, coli a 
30 than S, pyogenes PolC. Thus, it seems reasonable to expect that the DnaE polymerase 
may also function with the p clamp (Figs. 21 A-B). To test DnaE for function with 
t55' and p, replication reactions contained 70 ng (25 fmol) of 30-mer singly primed 
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M13mpl8 ssDNA, 0.82 \ig of S. pyogenes SSB, and 3.3 ng - 300 ng of DnaE (25 fmol 
- 2.3 pmol) in 23.5 ^1 of 20 mM Tris-HCl (pH 7.5), 4% glycerol, 0.1 mM EDTA, 5 
mM dithiothreitol (DTT), 40 fig/ml BSA, 2 mM ATP, 8 mM MgCl,, and 60 nM each 
of dGTP and dCTP. When present, reactions included 43.3 ng of p and 10 ng of t55'. 
Reactions were preincubated for 3 min at 37°C, and then NaCI was added to 40 mM 
followed by another 2 min at 37^C. DNA synthesis was initiated upon addition of 1 .5 
III of 1 .5 mM dATP, 0,5 mM [a^^P]-dTTP (specific activity 2,000-4,000 cpm/pmol). 
Aliquots of 25 ^l were removed at the indicated times and quenched with an equal 
volume (25 ^l) of 1% SDS, 40 mM EDTA. One-half of the quenched reaction was 
analyzed for total deoxynucleotide incorporation using DE81 filter paper and the other 
half was analyzed on a 0.8% neutral agarose gel. The effect of TMAU was also 
examined, in which lOO^M TMAU in DMSO (2% DMSO final concentration) was 
present. In this case, replication was allowed to proceed for 1 min before being 
quenched with 25 ^1 of 1% SDS, 40 mM EDTA. 

At a saturating concentration of DnaE polymerase, the time course of primer 
extension shows that it completes an M13mpl8 primed ssDNA template within 2 
minutes for a speed of at least 60 nucleotides/s (Fig. 21 C). This rate of synthesis 
holds true for the highest amount of DnaE in the rightmost panel of the figure. As the 
DnaE concentration is decreased, a longer time is required to complete the circular 
template, indicating that the DnaE polymerase is not processive over the entire length 
of the M 1 3mp 1 8 template. If the DnaE polymerase were fully processive during 
synthesis of the 7.2 kb ssDNA circle, the product profile over time would be 
qualitatively similar at all concentrations of enzyme, but the overall intensity of the 
profile would be diminished. This particular experiment was performed in the 
absence of p, but presence of t55'. When repeated in the presence of p but without 
t55', and in the absence of both p and t58*, results similar to those shown in Fig. 21C 
were observed. 

In the presence of p and t58', DnaE polymerase is stimulated in synthesis at 
low concentration, indicating that P increases the processivity and/or speed of DnaE 
(Figs. 21C-D). At higher concentrations of DnaE, the presence of P/t68' has no effect 
on the rate of synthesis, and thus p does not increase the intrinsic speed of the enzyme 
(i.e., panels 3 and 4 of Fig. 2 ID). Hence, the effect of the p clamp on DnaE is 
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primarily due to an increase in processivity. The profile of product length over time 
remains essentially unchanged at the different DnaE concentrations, and therefore the 
processivity of DnaE, with P is at least equal to the 12 kb length of the M13mpl8 
substrate. 

The DnaE sequence does not show homology to an exonuclease, implying that 
it may have no associated nuclease activity. The DnaE preparation was examined for 
the presence of a 3'-5' exonuclease (Fig. 2 IE). The DnaE and PolC polymerases were 
each incubated with a 5' 32P-Iabeled oligonucleotide, followed by analysis in a 
sequencing gel. The result showed no degradation of the oligonucleotide by DnaE. 
Pole is a known 3*-5' exonuclease and it digests the end-labeled oligonucleotide as 
expected. 

Gram positive PolC is known to be inhibited by the antibiotic 
hydroxyphenylaza-uracil ("HPUra") and its derivatives. In Fig. 2 IF, the PoIC tSS', p 
and DnaE were tested for inhibition of synthesis on SSB coated primed M13mpl 8 
ssDNA by an HPUra derivative, trimethylanilino-uracil ("TMALT^. The Po1C t65' p 
enzyme was prevented from forming the RFII product by TMAU. In contrast, the 
DnaE polymerase was not affected by TMAU in the presence of T56yp (nor in the 
absence of xdSVp, not shown). 

Although the invention has been described in detail for the purpose of 
illustration, it is understood that such detail is solely for that purpose, and variations 
can be made therein by those skilled in the art without departing from the spirit and 
scope of the invention which is defined by the following claims. 
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WHAT IS CLAIMED : 

1 . An isolated DNA molecule from a Gram positive bacterium, 
the isolated DNA molecule comprising a coding region from a polC gene, a dnaE 

5 gene, a holA gene, a holB gene, a dnaX gene, a dnaN gene, a ssb gene, a dnaG gene, or 
a dnaB gene. 

2. The isolated DNA molecule according to claim 1, wherein the 
DNA molecule comprises the coding region from the polC gene, 

10 

3. The isolated DNA molecule according to claim 2, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 

4. An isolated DNA molecule according to claim 3, wherein the 
15 DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 18. 

5. The isolated DNA molecule according to claim 4, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. tt). No. 1 7. 

20 6. The isolated DNA molecule according to claim 2, wherein the 

DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 17 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37''C. 

25 7, The isolated DNA molecule according to claim 1, wherein the 

DNA molecule comprises the coding region from the dnaE gene. 

8. The isolated DNA molecule according to claim 7, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 



30 



9. The isolated DNA molecule according to claim 8, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 20. 
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10. The isolated DNA molecule according to claim 9, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 19. 

1 1 . The isolated DNA molecule according to claim 7, wherein the 
5 DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 19 under 

stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of ST'^C. 



1 2. The isolated DNA molecule according to claim 1 , wherein the 
10 DNA molecule comprises the coding region from the holA gene. 

13. The isolated DNA molecule according to claim 12, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 

15 14. The isolated DNA molecule according to claim 13, wherein the 

DNA molecule encodes an amino acid sequence comprising SEQ. ED. No. 22. 



15. The isolated DNA molecule according to claim 14, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 21 . 

20 

16. The isolated DNA molecule according to claim 12, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 21 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

25 

17. The isolated DNA molecule according to claim 12, wherein the 
Gram positive bacterium is Staphylococcus aureus. 



1 8. The isolated DNA molecule according to claim 1 7, wherein the 
30 DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 12. 

1 9. The isolated DNA molecule according to claim 1 8, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 1 1 . 
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20. The isolated DNA molecule according to claim 12, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 1 1 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 

2 1 . The isolated DNA molecule according to claim 1 , wherein the 
DNA molecule comprises the coding regiong from the holB gene. 

22. The isolated DNA molecule according to claim 21 , wherein the 
Gram positive bacterium is Streptococcus pyogenes, 

23. The isolated DNA molecule according to claim 22, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ED. No. 24. 

24. The isolated DNA molecule according to claim 23, wherem the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 23. 

25. The isolated DNA molecule according to claim 2 1 , wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 23 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 



26. The isolated DNA molecule according to claim 21, wherein the 
Gram positive bacterium is Staphylococcus aureus. 

27. The isolated DNA molecule according to claim 26, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. E). No. 14. 

28. The isolated DNA molecule according to claim 27, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 13. 
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29. The isolated DNA molecule according to claim 21 , wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 13 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37X. 

30. The isolated DNA molecule according to claim 1, wherein the 
DNA molecule comprises the coding region from the dnaX g&no. 

3 1 . The isolated DNA molecule according to claim 30, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 

32. The isolated DNA molecule according to claim 31, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 26. 

33. The isolated DNA molecule according to claim 32, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ED. No. 25. 

34. The isolated DNA molecule according to claim 30, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 25 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37X. 

35. The isolated DNA molecule according to claim 1, wherein the 
DNA molecule comprises the coding region from the dnaN gene, 

36. The isolated DNA molecule according to claim 35, wherein the 
Gram positive bacterium is Streptococcus pyogenes, 

37. The isolated DNA molecule according to claim 36, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 28. 

38. The isolated DNA molecule according to claim 37, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 27. 
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39. The isolated DNA molecule according to claim 35, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 27 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37*^0. 

40. The isolated DNA molecule according to claim 1 , wherein the 
DNA molecule comprises the coding region from the ssb gene. 

41 . The isolated DNA molecule according to claim 40, wherein the 
Gram positive bacteriimi is Streptococcus pyogenes, 

42. The isolated DNA molecule according to claim 41 , wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ED. No. 30. 

43. The isolated DNA molecule according to claim 42, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 29. 

44. The isolated DNA molecule according to claim 40, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 29 imder 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 



45. The isolated DNA molecule according to claim 1 , wherein the 
DNA molecule comprises the coding region from the dnaG gene. 

46. The isolated DNA molecule according to claim 45, wherein the 
Gram positive bacterium is Streptococcus pyogenes, 

47. The isolated DNA molecule according to claim 46, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 32. 
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48. The isolated DNA molecule according to claim 47, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No, 31. 

49. The isolated DNA molecule according to claim 45, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 31 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

50. The isolated DNA molecule according to claim 1 , wherein the 
DNA molecule comprises the coding region from the dnaB gene. 

5 1 . The isolated DNA molecule according to claim 50, wherein the 
Gram positive bacterium is Streptococcus pyogenes, 

52. The isolated DNA molecule according to claim 5 1, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 34. 

53. The isolated DNA molecule according to claim 52, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 33. 

54. The isolated DNA molecule according to claim 50, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 33 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

55. An expression system comprising an expression vector into 
which is inserted a heterologous DNA molecule according to claim 1. 

56. The expression system according to claim 55, wherein the 
heterologous DNA molecule is in sense orientation and correct reading frame. 

57 A host cell comprising a heterologous DNA molecule 
according to claim 1 . 
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5 



58. An isolated protein or polypeptide from a Gram positive 
bacterium, wherein the isolated protein or polypeptide is alpha-large, alpha-small, 
delta, delta prime, tau, beta, SSB, DnaG, or DnaB. 

59. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is alpha-large. 



60. The isolated protein or polypeptide according to claim 59, 
10 wherein the Gram positive bacterium is Streptococcus pyogenes, 

61. The isolated protein or polypeptide according to claim 60, 
wherein the alpha-large protein or polypeptide comprises an amino acid sequence of 
SEQ. DD. No. 18. 

15 

62. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is alpha-small. 

63. The isolated protein or polypeptide according to claim 62, 
20 wherein the Gram positive bacterium is Streptococcus pyogenes, 

64. The isolated protein or polypeptide according to claim 63, 
wherein the alpha-small protein or polypeptide comprises an amino acid sequence of 
SEQ. ID. No. 20. 



25 



65. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is delta. 



30 



66. The isolated protein or polypeptide according to claim 65, 
wherein the Gram positive bacterium is Streptococcus pyogenes. 
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67. The isolated protein or polypeptide according to claim 66, 
wherein the delta protein or polypeptide comprises an amino acid sequence of SEQ. 
ED. No. 22. 

68. The isolated protein or polypeptide according to claim 65, 
wherein the Gram positive bacterium is Staphylococcus aureus. 

69. The isolated protein or polypeptide according to claim 68, 
wherein the delta protein or polypeptide comprises an amino acid sequence of SEQ. 
ID. No. 12. 

70. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is delta prime. 

7 1 . The isolated protein or polypeptide according to claim 70, 
wherein the Gram positive bacterium is Streptococcus pyogenes. 

72. The isolated protein or polypeptide according to claim 71, 
wherein the deha prime protein or polypeptide comprises an amino acid sequence of 
SEQ. ID. No. 24. 

73. The isolated protein or polypeptide according to claim 70, 
wherein the Gram positive bacterium is Staphylococcus aureus. 

74. The isolated protein or polypeptide according to claim 73, 
wherein the delta prime protein or polypeptide comprises an amino acid sequence of 
SEQ. ID. No. 14. 

75. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is tau. 



76. The isolated protein or polypeptide according to claim 75, 
wherein the Gram positive bacterium is Streptococcus pyogenes. 
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77. The isolated protein or polypeptide according to claim 76, 
wherein the tau protein or polypeptide comprises an amino acid sequence of SEQ. K). 
No, 26. 

78. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is beta. 

79. The isolated protein or polypeptide according to claim 78, 
wherein the Gram positive bacterium is Streptococcus pyogenes. 

80. The isolated protein or polypeptide according to claim 79, 
wherein the beta protein or polypeptide comprises an amino acid sequence of SEQ. 
ID. No. 28. 



8 1 . The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is SSB. 

82. The isolated protein or polypeptide according to claim 81, 
wherein the Gram positive bacterium is Streptococcus pyogenes. 

83. The isolated protein or polypeptide according to claim 82, 
wherein SSB comprises an amino acid sequence of SEQ. ID. No. 30. 

84. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is DnaG. 

85. The isolated protein or polypeptide according to claim 84, 
wherein the Gram positive bacterium is Streptococcus pyogenes. 



86. The isolated protein or polypeptide according to claim 85, 
wherein the DnaG protein or polypeptide comprises an amino acid sequence of SEQ. 
ID. No. 32. 
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87. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is DnaB. 

88. The isolated protein or polypeptide according to claim 87, 
wherein the Gram positive bacterium is Streptococcus pyogenes, 

89. The isolated protein or polypeptide according to claim 88, 
wherein the DnaB protein or polypeptide comprises an amino acid sequence of SEQ. 
ED. No, 34. 



90. A method of identifying compounds which inhibit the activity 
of a polymerase product of polC or dnaE comprising: 

forming a reaction mixture comprising a primed DNA molecule, a 
polymerase product of polC or dnaE, a candidate compound, a dNTP, and optionally 
either a beta subunit, a tau complex, or both the beta subunit and the tau complex, 
wherein at least one of the polymerase product oipolC or dnaE, the beta subunit, the 
tau complex, or a subunit or combination of subunits thereof is derived from a 
Eubacteria other than Escherichia coli; 

subjecting the reaction mixture to conditions effective to achieve 
nucleic acid polymerization in the absence of the candidate compound; 

analyzing the reaction mixture for the presence or absence of nucleic 
acid polymerization extension products; and 

identifying the candidate compound in the reaction mixture where there 
is an absence of nucleic acid polymerization extension products. 



9 1 . The method according to claim 90, wherein the polymerase 
product of polC or dnaE is from a Streptococcus bacterium or a Staphylococcus 
bacterium. 
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FIGURE 3 
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SEQUENCE LISTING 
<110> The Rockefeller University 

<12 0> DNA REPLICATION PROTEINS OF GRAM POSITIVE BACTERIA AND 
THEIR USE TO SCREEN FOR CHEMICAL INHIBITORS 

<130> 22221/1022 

<140> 
<141> 

<150> 60/146,178 
<151> 1999-07-29 

<160> 84 

<170> PatentIn Ver. 2.1 

<210> 1 
<211> 3195 
<212> DNA 

<213> Staphylococcus aureus 
<400> 1 

atggtggcat atttaaatat tcatacggct tatgatttgt taaattcaag cttaaaaata 60 

gaagatgccg taagacttgc tgtgtctgaa aatgttgatg cacttgccat aactgacacc 120 

aatgtattgt atggttttcc taaattttat gatgcatgta tagcaaataa cattaaaccg 180 

atttttggta tgacaatata tgtgacaaat ggattaaata cagtcgaaac agttgttcta 240 

gctaaaaata atgatggatt aaaagatttg tatcaactat catcggaaat aaaaatgaat 300 

gcattagaac atgtgtcgtt tgaattatta aaacgatttt ctaacaatat gattatcatt 360 

tttaaaaaag tcggtgatca acatcgtgat attgtacaag tgtttgaaac ccataatgac 420 

acatatatgg accaccttag tatttcgatt caaggtagaa aacatgtttg gattcaaaat 480 

gtttgttacc aaacacgtca agatgccgat acgatttctg cattagcagc tattagagac 540 

aatacaaaat tagacttaat tcatgatcaa gaagattttg gtgcacattt tttaactgaa 600 

aaggaaatta atcaattaga tattaaccaa gaatatttaa cgcaggttga tgttatagct 660 

caaaagtgtg atgcagaatt aaaatatcat caatctctac ttcctcaata tgagacacct 720 

aatgatgaat cagctaaaaa atatttgtgg cgtgtcttag ttacacaatt gaaaaaatta 780 

gaacttaatt atgacgtcta tttagagcga ttgaaatatg agtataaagt tattactaat 840 

atgggttttg aagattattt cttaatagta agtgatttaa tccattatgc gaaaacgaat 900 

gatgtgatgg taggtcctgg tcgtggttct tcagctggct cactggtcag ttatttattg 960 

ggaattacaa cgattgatcc tattaaattc aatctattat ttgaacgttt tttaaaccca 1020 

gaacgtgtaa caatgcctga tattgatatt gactttgaag atacacgccg agaaagggtc 1080 

attcagtacg tccaagaaaa atatggcgag ctacatgtat ctggaattgt gactttcggt 1140 

catctgcttg caagagcagt tgctagagat gttggaagaa ttatggggtt tgatgaagtt 1200 

acattaaatg aaatttcaag tttaatccca cataaattag gaattacact tgatgaagca 1260 

tatcaaattg acgattttaa agagtttgta catcgaaacc atcgacatga acgctggttc 1320 

agtatttgta aaaagttaga aggtttacca agacatacab ctacacatgc ggcaggaatt 13 80 
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attattaatg accatccatt atatgaatat 
ttaacgcaat ggacaatgac tgaagccgaa 
gggttgagaa acttatcgat tattcatcaa 
attaatattg atatcgaaaa gattccgttt 
caaggagata cgactggcat attccaatta 
aaattaaagc cggaacactt tgaagatatt 
ccaatggaag aaattccaac ttacattaca 
ttacatccgc atttagaacc tatattaaaa 



gcccctttaa cgaaagggga tacaggatta 1440 

cgtattgggt tattaaaaat agattttcta 1500 

atcttaacac aagtcaaaaa agatttaggt 1560 

gatgatcaaa aagtgtttga attgttgtcg 1620 

gagtctgacg gtgtaagaag tgtattaaaa 1680 

gttgctg.taa cttctttgta tagaccaggt 1740 

agaagacatg atccaagcaa agttcaatat 1800 

aatacttacg gtgttattat ttatcaagag 1860 
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Phe Tyr Asp Ala Cys He Ala Asn Asn lie Lys Pro He Phe Gly Met 
50 55 60 

Thr He Tyr Val Thr Asn Gly Leu Asn Thr Val Glu Thr Val Val Leu 
65 70 75 80 

Ala Lys Asn Asn Asp Gly Leu Lys Asp Leu Tyr Gin Leu Ser Ser Glu 
85 90 95 



He Lys Met Asn Ala Leu Glu His Val Ser Phe Glu Leu Leu Lys Arg 
100 105 110 

Phe Ser Asn Asn Met He He He Phe Lys Lys Val Gly Asp Gin His 
115 120 125 

Arg Asp He Val Gin Val Phe Glu Thr His Asn Asp Thr Tyr Met Asp 
130 135 140 

His Leu Ser He Ser He Gin Gly Arg Lys His Val Trp He Gin Asn 
145 150 155 160 

Val Cys Tyr Gin Thr Arg Gin Asp Ala Asp Thr He Ser Ala Leu Ala 
165 170 175 

Ala He Arg Asp Asn Thr Lys Leu Asp Leu He His Asp Gin Glu Asp 
180 185 190 

Phe Gly Ala His Phe Leu Thr Glu Lys Glu He Asn Gin Leu Asp He 
195 200 205 

Asn Gin Glu Tyr Leu Thr Gin Val Asp Val He Ala Gin Lys Cys Asp 
210 215 220 

r 

Ala Glu Leu Lys Tyr His Gin Ser Leu Leu Pro Gin Tyr Glu Thr Pro 
225 230 235 240 

Asn Asp Glu Ser Ala Lys Lys Tyr Leu Trp Arg Val Leu Val Thr Gin 
245 250 255 



Leu Lys Lys Leu Glu Leu Asn Tyr 
260 

Tyr Glu Tyr Lys Val He Thr Asn 
275 280 

He Val Ser Asp Leu He His Tyr 
290 295 



Asp Val Tyr Leu Glu Arg Leu Lys 
265 270 

Met Gly Phe Glu Asp Tyr Phe Leu 
285 

Ala Lys Thr Asn Asp Val Met Val 
300 
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Gly Pro Gly Arg Qly Ser Ser Ala Gly Ser Leu Val Ser Tyr Leu Leu 
305 310 315 320 

Gly lie Thr Thr lie Asp Pro lie Lys Phe Asn Leu Leu Phe Glu Arg 
325 330 335 

Phe Leu Asn Pro Glu Arg Val Thr Met Pro Asp He Asp He Asp Phe 
340 345 350 

Glu Asp Thr Arg Arg Glu Arg Val He Gin Tyr Val Gin Glu Lys Tyr 
355 360 365 

Gly Glu Leu His Val Ser Gly He Val Thr Phe Gly His Leu Leu Ala 
370 375 380 

Arg Ala Val Ala Arg Asp Val Gly Arg He Met Gly Phe Asp Glu Val 
385 390 395 400 

Thr Leu Asn Glu He Ser Ser Leu He Pro His Lys Leu Gly He Thr 
405 410 415 

Leu Asp Glu Ala Tyr Gin He Asp Asp Phe Lys Glu Phe Val His Arg 
420 425 430 

Asn His Arg His Glu Arg Trp Phe Ser He Cys Lys Lys Leu Glu Gly 
435 440 445 

Leu Pro Arg His Thr Ser Thr His Ala Ala Gly He He He Asn Asp 
450 455 460 

His Pro Leu Tyr Glu Tyr Ala Pro Leu Thr Lys Gly Asp Thr Gly Leu 
465 470 475 480 

Leu Thr Gin Trp Thr Met Thr Glu Ala Glu Arg He Gly Leu Leu Lys 
485 490 495 

He Asp Phe Leu Gly Leu Arg Asn Leu Ser He He His Gin He Leu 
500 505 510 

Thr Gin Val Lys Lys Asp Leu Gly He Asn He Asp He Glu Lys He 
515 520 525 

Pro Phe Asp Asp Gin Lys Val Phe Qlu Leu Leu Ser Gin Gly Asp Thr 
530 535 540 

Thr Gly He Phe Gin Leu Glu Ser Asp Gly Val Arg Ser Val Leu Lys 
545 550 555 560 
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Lys Lou Lys Pro Glu His Phe Glu Asp He Val Ala Val Thr Ser Leu 
565 570 575 

Tyr Arg Pro Gly Pro Met Glu Glu He Pro Thr Tyr He Thr Arg Arg 
580 585 590 

His Asp Pro Ser Lys Val Gin Tyr Leu His Pro His Lou Glu Pro Ho 
595 600 605 
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Gly Ala Phe Asp Ala Pho Gly Lys Thr Arg Ser Thr Leu Leu Gin Ala 
820 825 830 

lie Asp Gin Val Leu Asp Gly Asp Leu Asn lie Glu Gin Asp Gly Phe 
835 840 845 

Leu Phe Asp He Leu Thr Pro Lys Gin Met Tyr Glu Asp Lys Glu Glu 
850 855 860 

Leu Pro Asp Ala Leu He Ser Gin Tyr Glu Lys Glu Tyr Leu Gly Phe 
865 870 875 880 

Tyr Val Ser Gin His Pro Val Asp Lys Lys Phe Val Ala Lys Gin Tyr 
885 890 895 

Leu Thr He Phe Lys Leu Ser Asn Ala Gin Asn Tyr Lys Pro He Leu 
900 905 910 

Val Gin Phe Asp Lys Val Lys Gin He Arg Thr Lys Asn Gly Gin Asn 
915 920 925 

Met Ala Phe Val Thr Leu Asn Asp Gly He Glu Thr Leu Asp Gly Val 
930 935 940 

He Phe Pro Asn Gin Phe Lys Lys Tyr Glu Glu Leu Leu Ser His Asn 
945 950 955 960 

Asp Leu Phe He Val Ser Gly Lys Phe Asp His Arg Lys Gin Gin Arg 
965 970 975 

Gin Leu He He Asn Glu He Gin Thr Leu Ala Thr Phe Glu Glu Gin 
980 985 990 

Lys Leu Ala Phe Ala Lys Gin He He He Arg Asn Lys Ser Gin He 
995 1000 1005 

Asp Met Phe Glu Glu Met He Lys Ala Thr Lys. Glu Asn Ala Asn Asp 
1010 1015 1020 

Val Val Leu Ser Phe Tyr Asp Glu Thr He Lys Gin Met Thr Thr Leu 
1025 1030 1035 1040 

Gly Tyr He Asn Gin Lys Asp Ser Met Phe Asn Asn Phe He Gin Ser 
1045 1050 1055 

Phe Asn Pro Ser Asp He Arg Leu He 
1060 1065 
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<210> 3 
<211> 1698 
<212> DNA 

<213> Staphylococcus aureus 
<400> 3 

ttgaattatc aagccttata tcgtatgtac agaccccaaa gtttcgagga tgtcgtcgga 60 

caagaacatg tcacgaagac attgcgcaat gcgatttcga aagaaaaaca gtcgcatgca 120 

tatattttta gtggtccgag aggtacgggg aaaacgagta ttgccaaagt gtttgctaaa 180 

gcaatcaact gtttaaatag cactgatgga gaaccttgta atgaatgtca tatttgtaaa 240 

ggcattacgc aggggactaa ttcagatgtg atagaaattg atgctgctag taataatggc 300 

gttgatgaaa taagaaatat tagagacaaa gttaaatatg caccaagtga atcgaaatat 360 

aaagtttata ttatagatga ggtgcacatg ctaacaacag gtgcttttaa tgccctttta 420 

aagacgttag aagaacctcc agcacacgct atttttatat tggcaacgac agaaccacat 480 

aaaatccctc caacaatcat ttctagggca caacgttttg attttaaagc aattagccta 540 

gatcaaattg ttgaacgttt aaaatttgta gcagatgcac aacaaattga atgtgaagat 600 

gaagccttgg catttatcgc taaagcgtct gaagggggta tgcgtgatgc attaagtatt 660 

atggatcagg ctattgcttt cggcgatggc acattgacat tacaagatgc cctaaatgtt 720 

acgggtagcg ttcatgatga agcgttggat cacttgtttg atgatattgt acaaggtgac 7 80 

gtacaagcat cttttaaaaa ataccatcag tttataacag aaggtaaaga agtgaatcgc 840 

ctaataaatg atatgattta ttttgtcaga gatacgatta tgaataaaac atctgagaaa 900 

gatactgagt atcgagcact gatgaactta gaattagata tgttatatca aatgattgat 960 

cttattaatg atacattagt gtcgattcgt tttagtgtga atcaaaacgt tcattttgaa 1020 

gtattgttag taaaattagc tgagcagatt aagggtcaac cacaagtgat tgcgaatgta 1080 

gctgaaccag cacaaattgc ttcatcgcca aacacagatg tattgttgca acgtatggaa 114 0 

cagttagagc aagaactaaa aacactaaaa gcacaaggag tgagtgttgc tcctactcaa 1200 

aaatcttcga aaaagcctgc gagaggtata caaaaatcta aaaatgcatt ttcaatgcaa 12 60 

caaattgcaa aagtgctaga taaagcgaat aaggcagata tcaaattgtt gaaagatcat 1320 

tggcaagaag tgattgacca tgcccaaaac aatgataaaa aatcactcgt tagtttattg 13 80 

caaaattcgg aacctgtggc ggcaagtgaa gatcacgtcc ttgtgaaatt tgaggaagag 1440 

atccattgtg aaatcgtcaa taaagacgac gagaaacgta gtagtataga aagtgttgta 1500 

tgtaatatcg ttaataaaaa cgttaaagtt gttggtgtac catcagatca atggcaaaga 1560 

gttcgaacgg agtatttaca aaatcgtaaa aacgaaggcg atgatatgcc aaagcaacaa 1620 

gcacaacaaa cagatattgc tcaaaaagca aaagatcttb tcggtgaaga aactgtacat 1680 
gtgatagatg aagagtga 1698 



<210> 4 
<211> 566 
<212> PRT 

<213> Staphylococcus aureus 
<400> 4 

Leu Asn Tyr Gin Ala Leu Tyr Arg Met Tyr Arg Pro Gin Ser Pho Glu 
15 10 15 

Asp Val Val Gly Gin Olu His Val Thr Lys Thr Leu Arg Asn Ala lie 
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